f\V£  GOPXi  40  AO88O77 


LIDS-P-1006  / 


ffiQSR-TR-  8  0-0  583 

June  1980 


ON  THE  ANALOGY  BETWEEN  MATHEMATICAL  PROBLEMS 
OF  NON-LINEAR  FILTERING  AND  QUANTUM  PHYSICS 


by 


San joy  K.  Mitter 

Department  of  Electrical  Engineering 
and  Computer  Science 
and 

Laboratory  for  Information  and  Decision  Systems 
Massachusetts  Institute  of  Technology 
Cambridge,  Massachusetts  02139  U.S.A. 


(l)  This  research  has  been  supported  by  the  Air 
Force  Office  of  Scientific  Research  under 
grants  AFOSR  77-3321,  AFOSR  77-3281,  and 
AFOSR  77-3281B.  To  appear  in  Ricerche  di 


Automatica,  Special  Issue  on  System  Theory 


and  Physics. 


Appro 


1  fr 


80  8  14  104 


distrt 


Ibut 


ion  vrnl-  im 


.  L  d  • 


_ UNCLASSIFIED _ +_  „ 

SECURITY  CL  ASSlFtCATlQN  OF  THIS  PAGE  (When  Otu  n^Fntervd)  ^ 

I  report  Documentation  page 


ifi  •  mpt-s  l  \  y_  '^1 


READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 
:iRjE^T*S  CATALOG  NUMBER 


-  -TfPtr  E  SrrbtULzX 


! 6>  l  QN  THE  ANALOGY  BETWEEN  MATHEMATICAL^PROBLEMS 

(  ,1  -r\ir)  *r»  TS  T1  *nnr»Tvn  K  VTT\  OiTT  A  VTrP1TVf  T5UVOTPC 


OF  NON-LINEAR  FILTERING  AND  QUANTUM  PHYSICS, 

✓  r  f 


*-P£RkOQ  COVERED 

Is/  ^  Interim  si _ 


AUT-HO&**; 


tC  San  joy  K  .^titter 


7s; 


9.  PERFORMING  ORGANIZATION  name  an’d  aodress 

M.  I.  T. 

Lab  for  Information  &  Decision  Systems  ' 

Cambridge,  MA  02139 _ _ 

' i.  controlling  office  name  and  aooress 


•f  Interim  _ 

I  6.  PEBFORMure  OTG.  REPORT  .EHhfBER 

a.~  ^HTiTi'TTiTrgSrNT  NUM'few^t; 

ir-  AFOSR  77-3281 
'  iu.  project  task 

AREA  &  WORK  UNIT  NUMBS-WB  -a 


61102F  2304/A6/ 


Air  Force  Office  of  Scientific  Research/NM  ( ^ - 

Bolling  AFB,  Washington,  DC  20332  ^umber  of  pages 

14  MONITORING  AGEMS^MJME  »  AF>f>»ESr,fl7  different  Iron,  Controlling  Otlicc)  15  SECURITY  CLASS,  fo/  r hit  report) 

,•  ,  "  UNCLASSIFIED 

/  _ _ _ '  T5a.  DECL  ASSI  EKT  ATlOfT  DOWNGRADING- 

'  SCHEDULE 


ns-  DISTRIBUTION  STATEMENT  (of  <Mi  *  Report) 


Approved  for  public  release;  distribution  unlimited. 


|  >7.  DISTRIBUTION  STATEMENT  (ol  the  abstract  entered  in  Block  20,  II  diflerent  from  Report) 


18.  supplementary  notes 


I  19.  KEY  WORDS  ( Continue  on  reverse  aide  ti  necessary  and  identify  by  block  number) 


tO  ABSTRACT  ( Continue  on  reverse  side  ft  necessary  and  Identify  by  block  number) 

-The  main  thesis  of  this  paper  is  that  there  are  striking  similarities  between  thi 
mathematical  problems  of  stoc  stic  system  theory,  notably  linear  and  non-linear 
filter in  theory,  and  mathematical  developments  underlying  quantum  mechanics  and 
quantum  field  theory.  Thus  the  mathematical  developments  of  the  past  thirty 
years  in  functional  analysis,  lie  groups  and  lie  algebras,  group  representations 
and  probabilistic  methods  of  quantum  theory  can  serve  as  a  guide  and  indicator 
to  search  for  an  appropriate  theory  of  stochastic  systems.  In  the  current  state 
of  development  of  linear  and  non-linear  filtering  theory,  it  is  best  to  proceed 

0D  1473  eOlTION  OF  I  NOV  »»  I*  O0SOLCTC  7r'(0  W 


20.  Abstract  cont. 

'••by  "analogy"  and  with  care,  since  "unitar ity"  which  plays  such  an  important 
part  in  quantum  mechanics  and  quantum  field  theory  is  not  ncecssarily  relevant 
to  linear  and  non-linear  filtering  theory.  The  partial  differential  equatons 
that  arise  in  quantum  theory  are  generally  wave  equations,  whereas  the  partial 
differential  equations  arising  in  filtering  theory  are  stochastic  parabolic 
equations.  Nevertheless  the  possibility  of  passing  to  a  wave  equation  by  appri 
priate  analytic  continuation  from  the  parabolic  equation,  reminiscent  of  the 
current  program  in  euclidean  field  theory,  should  not  be  overlooked. 


UNCLASSIFIED 


iicgmrv  classification  o»  »**•'  notr'iim  n«>  tn 


1.  Introduction. 


The  main  thesis  of  this  paper  is  that  there  are  striking  similarities  between 
the  mathematical  problems  of  stochastic  system  theory,  notably  linear  and  non¬ 
linear  filtering  theory,  and  mathematical  developments  underlying  quantum  mechanics 
and  quantum  field  theory.  Thus  the  mathematical  developments  of  the  past  thirty 
years  in  functional  analysis,  lie  groups  and  lie  algebras,  group  representations 
and  probabilistic  methods  of  quantum  theory  can  serve  as  a  guide  and  indicator 
to  search  for  an  appropriate  theory  of  stochastic  systems.  In  the  current  state 
of  development  of  linear  and  non-linear  filtering  theory,  it  is  best  to  proceed 
by  "analogy"  and  with  care,  since  "unitarity"  which  plays  such  an  important  part 
in  quantum  mechanics  and  quantum  field  theory  is  not  necessarily  relevant  to  lin¬ 
ear  and  non-linear  filtering  theory.  The  partial  differential  equations  that  arise 
in  quantum  theory  are  generally  wave  equations,  whereas  the  partial  differential 
equations  arising  in  filtering  theory  axe  stochastic  parabolic  equations.  Never¬ 
theless  the  possibility  of  passing  to  a  wave  equation  by  appropriate  analytic  con¬ 
tinuation  from  the  parabolic  equation,  reminiscent  of  the  current  program  in  euc¬ 
lidean  field  theory,  should  not  be  overlooked. 

To  develop  these  ideas,  it  is  best  to  begin  with  a  reasonably  general  non¬ 
linear  filtering  problem: 

Let  (SI,A,P)  be  the  underlying  probability  space  and  let  x  denote  a  scalar- 

X 

(1) 

valued  diffusion  process  which  is  the  solution  of  the  following  Ito  stochastic 
differential  equation: 

(1.1)  dx^  =  f(x^)dt  +  g(xt)dwt  , 

where  wfc  is  standard  Brownian  notion  and  f  satisfies  appropriate  assumptions  so  that 

(1.1)  has  a  unique  solution  in  the  sense  of  Ito.  We  shall  refer  to  (l.l)  as  the 
physical  process.  Let 

(l)  There  is  no  difficulty  in  generalizing  to  the  vector  case. 
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(l.2)  z^_  =  h(x^)  with  z  e  L  (fi, A,P),  denote  the  signal  process,  and  we 

observe  the  signal  in  the  presence  of  another  Wiener  process 
t 

(1-3)  y £  =  y  zgds  +  ri^  .  We  shall  refer  to  (l.3)  asthe  observation  equation, 

o 

We  make  the  assumption  that  (x^_,  are  independent.  Let  F^  denote  the 

o-field  generated  by  {y^O  £  s  <  t}.  The  problem  of  non-linear  filtering  is  to 
recursively  compute 


(l.M  E[<j)(x  )|F^],  where  4>  say  is  a  bounded,  continuous  function.  E[4> ( • )|F^] 

XX  t 

denotes  conditional  expectation  with  respect  to  F^.  The  solution  to  this  problem 
can  be  obtained  by  Functional  Integration  and  the  Cameron-Martin-Gisanov  formula. 
Define  a  new  measure  P  on  (fi,A)  by  the  Cameron-Martin-Gisanov  transformation: 

d.5)  J  =  exp  (fQ  zgdys  -  \/qz \  ds). 

Under  this  new  measure,  the  probability  distribution  of  x^  remains  invariant, 
but  y^  and  x^  are  independent  and  y  is  standard  Brownian  notion. 

Let  denote  the  conditional  expectation  operator.  Then  a  standard  applica¬ 
tion  of  the  theory  of  conditional  expectations  gives  us: 


(1.6)  TTt(<f>)  =  E [ 4> ( )  |f£] 


mix.  )  ^  |Ff] 
*  dP  X 

X 


where  E  denotes  expectation  with  respect  to  the  P-measure.  The  mapping  is  de¬ 

fined  to  be  the  filter-  for  the  stochastic  svstem  (l.l)-(l.3).  tt.  itself  can  be 

z  - 

thought  of  as  a  measure-valued  stochastic  process. 

For  what  follows  it  is  convenient  to  rewrite  (l.6)  in  the  form  of  an  input - 


state-output  relation.  For  this  purpose  define 


1  denotes  the  constant  function  1  for  all  x. 


In  the  above  p^_  is  to  be  thought  of  as  the  state  of  the  filter  and  equation 

(1.8)  as  the  state-output  relation  of  the  filter. 

It  is  instructive  to  view  ( 1 . T )  and  (1.8)  in  the  light  of  Gelfand-Naimark- 

Segal  construction  of  states  and  representations.  The  functions  x^.  H-  <|>(x^)  are 

the  observables  of  the  physical  system  (l.l).  The  formula  (l.8)  computes  the 

conditional  statistics  of  the  observables  <p  given  the  observation  program 

(y  | 0<s<t) .  The  analogy  with  the  algebraic  theory  of  quantum  mechanics  is  strik- 
s 

ing,  the  notable  difference  being  that  the  idea  of  computing  conditional  statistics 
based  on  an  observation  program  seems  to  be  absent  in  physics.  This  viewpoint  turns 
out  to  be  important  in  the  definition  of  generalised  observables  for  quantum  systems 
as  probability  operator-valued  measures. 

It  can  be  shown  that  has  density  q_^  which  satisfies  a  stochastic  partial 
differential  equation 

(1.9)  dqt  =  L*  qtdt  +  Liqtdyt  , 

£ 

where  L  is  the  formal  adjoint  of  the  diffusion  process  generator  of  (l.l)  and 
u  Dune an - 

is  the  operator  :  multiplication  by  h(x).  This  is  the^Mortensen-Zakai  equation 

.  tN 

and  is  the  fundamental  equation  of  non-linear  filtering.  The  density  qt(z,yQ/ 
has  a  representation  as  a  function  space  integral 

^(z^q)  =  /Xex P</o  h(xs)dys  "  |  Jo  h2(x8)ds)dpZ(x) 


(1.10) 
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where  the  integration  is  over  the  path  space  of  x  with  xQ  =  0  and  x^  =  z. 

Now  in  equation  (1.9)  the  observation  {ys|o<s<t}  is  given  to  us,  and  if  we 
rewrite  equation  (l.9)  in  the  following  suggestive  form  using  Stratanovich  calculus: 

"dt"  ^L0  “  2  +  ^tLl<Lt  ’ 

then  equation  (l.ll)  is  the  analogue  of  a  euclidean  (imaginary  time)  quantum  field 
a  time-dependent 

with  A  external  force  defined  by  the  observations.  We  say  that  the  analogy  is  to 
euclidean  quantum  field  (as  opposed  to  euclidean  quantum  mechanics)  since  q^_  is  a 
measure-valued  stochastic  process.  This  paper  is  concerned  with  a  systematic  inves¬ 
tigation  of  this  point  of  view.  In  particular,  in  this  picture  the  Kalman  filter 
occupies  the  role  of  the  free  quantum  field. 

This  paper  is  divided  into  six  sections.  In  section  2  we  discuss  the  rela- 
ship  between  Dirichlet  and  Schrodinger  operators  and  show  that  it  is  possible  to 
associate  a  stochastic  process  with  the  ground  state  measure  of  Schrodinger  opera¬ 
tors.  Section  3  is  devoted  to  the  Bayes  formula  on  non-linear  filtering  and  des¬ 
cribes  the  Feynman  point  of  view  for  non-linear  filtering.  Section  4  is  concerned 
with  the  construction  of  Fock  space  and  discussing  its  role  in  non-linear  filtering. 
In  section  5  we  argue  that  the  Lie  algebra  of  operators  L  =  LAiL^  -  —  L^,L^J  has 
an  important  role  to  play  in  non-linear  filtering.  In  section  6  we  discuss  the 
question  of  representation  of  the  filter. 

This  is  a  semi-expository  paper  and  we  have  tried  to  concentrate  on  the  ideas 
involved  and  emphasize  a  certain  point  of  view.  The  ideas  come  from  constructive 
quantum  field  theory  as  emphasized  by  Nelson  and  Segal,  recent  developments  in 
system  theory  and  the  theory  of  group  representations.  It  is  our  hope  that  this 
paper  will  go  towards  pointing  in  a  small  way  the  conceptual  depth  of  stochastic 
system  theory  which  is  still  in  its  infancy. 

(l)  That  ie  as  integration  over  the  path  space  of  x,  conditioned  by  x  *0  and  x  "z. 

U  V 
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2.  The  Feynman-Kac  Formula,  Dir  ■  hlet  and  Schrodinger  Operators. 

2.1  Introduction 

Recent  work  on  non-linear  filtering  theory,  euclidean  quantum 
field  theory  and  the  stochastic  mechanics  of  Nelson  make  extensive  use  of 
the  Feynman-Kac  formula  and  the  interplay  between  Schrodinger  and  Dirichlet 
operators.  In  particular,  a  stochastic  process  associated  with  the  ground- 
state  measure  of  Schrodinger  operators  turns  out  to  be  important.  In  this 
section  we  give  an  account  of  these  ideas  and  the  related  theory  of  hyper- 
contractive  semigroups. 

2.2.  Preliminaries . 

Throughout  this  paper  integration  with  respect  to  Wiener  measure, 
the  Brownian  bridge  measure  and  the  oscillator  measure  will  be  important. 

The  relation  of  Wiener  measure  to  the  Laplacian  and  the  semigroup  generated 
by  the  Laplacian  will  also  be  important.  For  simplicity  we  shall  be  concerned 
with  scalar-valued  stochastic  processes.  There  is  no  difficulty  in  general¬ 
ising  these  ideas  to  vector-valued  stochastic  processes. 

(*)  t>, 

Let  n  =  CUR  ;R)  and  let  W  :w  ->■  W  («)  =  U)  :fi  -*■  JR  be  the  t 

coordinate  function.  We  denote  by  the  right  shift  on  H.  If  we  denote 

by  the  smallest  O-field  with  respect  to  which  {Wg  I  0  <_  s  <_  t}  are  measurable 

and  by  F  =  a  {Wt|t  _>  0}  ,  then  F  is  the  Borel  o-field  of  Q  and  on  (fl,F)  there 

is  a  unique  probability  measure  W  such  that  W{WQ=0}=0,  the  random  variables 

W  -W  ,...,  W  -W  are  independent,  Gaussian,  zero  mean,  with  variance 
10  n  n-1 

t^-tj  ^  for  the  increment.  This  measure  is  Wiener-measure.  For  each 

x  e  ]R,  the  probability  measure  W^  is  defined  by  path  translation:  ' 

(*)We  shall  also  use  the  notation  x(t)  for  x.  when  dealing  with  processes. 

\ 

i 
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W(B)  =  W(0  (B))  B  e  F  ,  where 

x  x 

9  (w)  =  x  +  u)  ,  x  e  ]R,  a)  e  t  >  0. 

X  X  X 

Alternatively  one  could  have  started  with  a  Gaussian  process 
{W  |  t  >_  0,  Wq  =  0  a. s}  which  is  zero-mean  and  has  variance  |t-s|  and 
stationary  independent  increments  and  constructed  Wiener  measure  as  a  path 
space  measure  on  C(]R  ;  ]R).  For  a  measurable  real-valued  function  f,  define 


(2.1)  (Ptf)(x)  =  (f(Wt)) 

X 

where  denotes  expectation  with  respect  to  W  .  If  the  right  hand  side 
x 

makes  sense, 

(2.2)  (P  f)(x)  =  f  f(y)p (t,y;0,x) 

where  p(t,y;0,x)  is  the  transition  density  of  Brownian  motion 

(2.3)  p(t,y;0,x)  =  - - — ~±  exp  /-  — j  ,  t  >  0,  x,  y  El. 

(2irt)  ~2  ' 


It  is  known  that  for  p  £[l,°°l,  (P^|t  0)  is  a  strongly  continuous  contraction 

semigroup  on  L^Qr) .  Its  infinitesimal  generator  is  -  )jA,  where  A  is  the 
Laplacian  and  0(A)  =  H^Or)  =  {feL^|Df  e  L*3  in  the  sense  of  distributions). 

For  certain  applications  we  shall  need  to  do  integration  with  respect 
to  conditional  Wiener  measure.  From  the  properties  of  Wiener  measure  and  the 


corresponding  transition  density  of  Brownian  motion,  for  0  <  t,  <...  <t 


W  (u)|ut(t  )e  A  , . . .  ,u)(t  )e  A  }  =  f  II  p(x  ,x  ;t  -t .  )x.  (x  )dx 

x  ix  mm  ]R  i=i  i-x  i  i  l-i  i  i 


where  is  the  characteristic  function  of  A,  xQ=x,  ^.^=0.  We  can  then 

construct  a  measure  W^  ^  on  the  continuous  paths  on  [0,t]  with  Wq=x,  W^=y 


with  probability  1,  0  <  t,  <...  <  t 

1  in 


W  ,  {(jjju(t  )e  A, , . . .  ,u>(t  )e  A  } 
x,y;t  'll  mm 


l  "  P^i-i’VWi*  XA.(xi))P(xm*yit-tnl)dxi 

JR  1=1  i 


W  ,  is  called  conditional  Wiener  measure.  We  have 
x,y;t  - 


(2.U)  /f(u>)dW  =  /dy(/f(w)dW  ) 
x  x,y 


for  f  which  are  functions  of  the  values  the  path  takes  on  [0,t]. 

This  measure  corresponds  to  the  Brownian  bridge  process,  which  is  the 

Gaussian  process  6  ,  say  on  [0,l]  with  covariance 
s 

£(6  6,)  =  s(l-t)  ,  0  <  s  <  t  <  1 

s  t 

In  terms  of  this  process  we  can  write  (2.*0  as 


(2.5)  /f(ws)dW0  =  /f((l  -  |-)x  +  y  +  &  6g)p(x,y;t)dxdyd6 

t 

where  d6  represents  the  measure  on  the  path  space  of  6- 

The  two  processes  which  we  have  dealt  with  are  not  stationary  Guass- 

Markov  processes.  For  this  reason  it  is  often  important  to  deal  with  the 

oscillator  process  which  is  the  family  of  Gaussian  random  variables 

{q_, ,  -°°  <  t  < co}  with  covariance  —  exp(-|t-s|).  It  is  related  to  the  operator 
^  2 

Lq  *  -  +  2  x  ~  2*  the  Harmonic  Oscillator  Hamiltonian, 

dx 

2.3  The  Feynman-Kac  Formula. 

Both  in  filtering  theory  and  in  quantum  physics  we  are  required  to  deal 
2 

with  an  operator  on  L  (36)  (say). 
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H  =  -  |A  +  V(x) 

where  A  is  the  Laplacian  and  we  need  to  compute  exp(-tH) ,  the  semi-group 
generated  by-H.  The  Feynman-Kac  formula  provides  us  with  a  representation 
of  the  operator  T^  =  exp(-tH)  as  a  Wiener  Integral.  The  first  approach 
exemplified  in  the  work  of  Nelson  uses  the  Trotter  Product  formula  to  make 
sense  of  the  formula  but  requires  information  on  the  self-adjointness  of 
the  operator  H.  The  second  approach  proceeds  by  writing  the  formula  first, 
proving  that  it  represents  a  strongly  continuous  one-parameter  semigroup 
and  then  its  unique  infinitesimal  generator  is  computed.  The  second  approach 
uses  probabilistic  tehcniques  and  turn  out  to  be  the  more  general  one. 

If  the  potential  is  bounded  above  then  a  general  form  of  the  Feynman- 
Kac  formula  can  be  obtained  using  martingale  methods.  For  this  purpose 
and  for  later  use  we  introduce  certain  probabilistic  machinery. 

Let  (ft,F,P)  be  a  probability  space  and  (F^,  t  >_  0)  a  non-decreasing 
family  of  sub  cr-fields  of  F.  Let  s  >  0  be  arbitrary  and  a:  [s,°°[x  ^  ]R  and 
b[s,°°[x  n  -*  ®  be  bounded  progressively  measurable  functions.  For  any 

p 

f  e  C  (IR),  define  A  f  for  t  >  s  by 


(2.6)  (At(q)f)(x) 


a(t,q)  (x)  +  b(t,q) 

9x 


9f 

9x 


If  £(*,*)  is  any  progressively  measurable  function  from  [s,°°[x  -*■  ]R, 

then  (A. (q)f)(5(t,q) )  defines  another  progressively  measurable  function  of 
t  and  q.  Then  it  can  be  shown  that  for  any  a  and  b  satisfying  the  above, 

(2.7)  f(t,£(t))  -  f  ( r~  ♦  A  )  f  ( u ,  £  ( u ) )  du 


( 


-9- 


1  2 

is  a  martingale  relative  to  (fi,,)  ,P)  for  all  t  >  s,  V  f  e  C  '  ( [O.^fx  !R)  . 

A  process  £  satisifying  (2.7)  will  be  referred  to  as  an  Ito  process  with 
drift  b  and  covariance  a.  The  Feynman-Kac  formula  depends  on  the  follow¬ 
ing  observation. 

Let  £  be  an  Ito  process  relative  to  (H,Ft,P)  with  drift  b  and  covariance 

a.  Then  for  any  progressively  measurable  function  V:[s,°°[xfl  -*■  K  which  is 

1  2 

bounded  below  and  f  e  C  '  ( [0,oo[x  iR] ) 

t 

f  -V(u)du  /-V(a)dO 

s  t  3f  s 

(2.8)  e  f  (t,£(t) )  -  /  (t^  +  Auf  -  V(u)f) (u,£(u))e  du 

s  ■ 

is  a  martingale  after  time  s. 

Now  define  the  operator 

t 

S  -V  (u)  du 

(2.9)  T^  f(s,x)  =  /  eS  f (t,C(t) )dygx(5) 

where  du  (E)  denotes  the  measure  on  the  path-space  of  E  given  E(s)  =  x. 
s  ,x 

Then  we  can  show  that  T^  is  a  one-parameter  strongly  continuous  semi-group 

and  from  (2.8)  we  conclude  that  its  differential  generator  is  +  V (u) . 

(2.9)  is  the  Feynman-Kac  formula.  When  we  specialize  to  the  case  when  E(t) 

d2 

is  Brownian  motion,  then  =  —  — ,j  and  the  formula  reads  setting  s=0. 

U  dx  ' 

t 

(2 . 9 ' )  (TV  f) (x)  =  E  [exp  ( /-V(W  (s) ) ds ) f (W (s) ) ] . 

t  w  n 

x  0 

Now  this  formula  turns  out  to  be  valid  for  a  much  larger-class  of 

V 

unbounded  potentials  and  the  semigroup  T^  has  strong  regularity  properties. 

(l)  This  argument  is  an  exercise  in  Stroock-Varadhan;  Multi -dimensional  Diffusion 
Processes,  Springer-Verlag,  N.Y.,  197 9- 
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The  more  general  class  of  potentials  we  consider  are:  (Hi)  V  is  a 
measurable  function  on  ]R  which  can  be  written  as  V  =  V^-V2  with  >.  0, 

Vg  e  LP(3R )  for  p  >  1  and  measurable,  bounded  below,  such  that  for  each 
compact  set  K  in  3R,  there  exists  q(K)  2  s.t. 

f  |  V  (x)  |  dx  <  00 . 

K 

For  this  class  of  potentials,  using  the  recent  estimate  (Berthier-Gaveau) 

A  t 

Vt  >  0,  Vr  >  0,  K(r,t)  -  sup.  E  (exp(-r  f  V(x  )ds  <  00 

x£E  x  0 

and  their  refinements  ,  the  Feynman-Kac  formula  holds  for  the  class  of 
potentials  governed  by  (Hi).  If  we  define  the  operator 

t 

Tt(f)(x)  =  (exp  (-  /  V(W  )ds)f(Wfl))  , 
x  0 

we  have 

Theorem  2.1.  (Carmona)  For  any  q  e[l,°°[  and  t  >  0, 

(i)  T  is  a  bounded  operator  on  and  | | T  | |  <_  K(l,t) 

(ii)  If  q'  denotes  the  conjugate  exponent  of  q,  then  for  f  £  i/1,  gel/1 

/(T  f)(x)g(x)dx  =  f  f(x)(T  g)(x)dx 
TR  X  % 

(iii)  is  a  strongly  continuous  semi-group  on  Lq 

(iv)  If  lim  V'(x)  =  +°°,  then  T.  is  a  compact  operator  on  i/1. 

ixi-*»  1  1  m 


Let  -H  denote  the  infinitesimal  generator  of  T  .  In  quantum  physics  we 
often  need  to  show  that  -H  is  the  self-adjoint  extension  of  -  4a  +  V,  the 
imaginary-time  Schrodinger  operator.  If  V  >  0  this  is  well  known.  For  the 
class  of  potentials  given  by  (Hi)  if  V  £  ^qoc  then 


Proposition  2.2.  (Carmona) 

C  OR)  C  P(H)  and  for  f  £  C  (®),Hf  =  -  ^(A+Vjf  where  H  is  the  infinitesiaml 
c  c  2 

generator  of  the  semigroup  T  defined  on  L^. 


In  mathematical  physics  perturbed  Hamiltonians  are  usually  defined  as 

sums  of  quadratic  forms.  We  briefly  review  this. 

Let  H  be  a  Hilbert  space.  A  quadratic  form  is  a  map  q:  Q(q)xQ(q)  -*•  IE, 

where  Q(q)  is  a  dense  linear  subset  of  H  ,  called  the  form  domain,  such 

that  q(  •  ,40  is  conjugate  linear  and  q(<J>,')  is  linear  for  <p,ipe  Q(q)-  If 

q(<(>,40  =  q(4J,4>)  we  say  that  q  is  symmetric.  If  q(<J>,4>)  >  0  V<t>eQ,  q  is 

2 

called  positive  and  if  q(4>,$)  >_  -  M|  |<|>|  |  ,  for  some  M  we  say  q  is  semi- 
bounded. 

Let  A  be  a  self-adjoint  operator  on  H.  By  passing  to  the  spectral 

CO 

2 

representation  of  A,  A  is  multiplication  by  x  on  (D  L  (]R,p  ).  Let 

n=l  n 

N  N  _ 

Q(q)  =  (x}|  £  |  £  f  x||4»  (x)|  dpn  <  »} 

1  n=l  n=l  " 


for  all  ,4>e  Q(q)  define 
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SI  00 

qC^)  =  l  f  x  4>n(x)ip  (x)dun 

q  is  called  the  quadratic  form  associated  with  A  and  writing  Q(A)  =  Q(q), 
Q(A)  is  called  the  form  domain  of  A. 

Let  q  be  a  semi-bounded  quadratic  form,  q(^>^)  -Ml |^| |  .  q  is  call“d 
closed  if  Q,(q)  is  complete  under  the  norm 


q(» >p,ip)  +  (M+l)  |  |4>j 


If  q  is  closed  and  Dc  Q(q)  is  dense  in  Q(q)  in  the  |  |*|  |+^  norm  then  D  is 
called  a  form  core  for  q.  The  following  fact  is  important. 

Theorem:  If  q  is  a  closed  semibounded  quadratic  form,  then  q  is  the 
quadratic  form  of  a  unique  self-adjoint  operator. 

2 

We  now  define  Schrodinger  operators  as  form  sums  on  L  OR).  For  f, 

2 

g  e  CcGR)  define 

£  (f,g)  =  77  f  Vf(x)  Vg(x)dx 

]R 

Integrating  by  parts 

eQ(f,g)  =  (-  ■I'Vfjg)  2  and  hence  is  closable.  The  form  domain 

1  k 

Q(eQ)  of  £q  is  H  OR).  Let  V  be  a  real-valued  measurable  fn.  on®  and  set 

Q(V)  =  {f  £L2|  f  |v(x)  |  |  f  (x)  |2dx  <  +”} 

]R 

On  Q(eQ)n  Q(V)  define 
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e(f,g)  =  £Q(f,g)  +  (Vf,g)  wuere 
(Vf  ,g)  =  (  (sgnV)  |v|1//2f,  |v|",2g) 

If  we  assume  V  =  V  -V_  s.t.  V,  bounded  below  and  V,  £  ,  V.  e  L1  then  . 

12  1  1  10c  .  2 

2 

£  is  the  form  of  a  unique  bounded  below  self-adjoint  operator  H  on  L 
with  form  domain  Q(H)  =  Q ( Gq )  PlQ(V). 

Let  be  a  Borel  prob.  measure  on  E  which  satisfies 

(2.10)  dy(x)  =  e“2h(x)dx 

where  h  is  a  real-valued,  locally  bounded,  absolutely  continuous  function 

2  OO.  , 

with  first  order  partial  derivatives  in  L^qc  .  For  all  f,  g  £  CcUR),  define 

(2.11)  6(f,g)  -  \  f  Vf(x)  Vglx)du(x) 

E 

Integrating  by  parts  the  right  hand  side,  we  get 

2 

(2.12)  <5(f,g)  =  (Df,g)^  ,  where  (•»•)  denotes  the  L  (y)-inner 

product  and 

(2.13)  Df  =  -  f  +  \7hVf 

From  (2.12)  6  is  given  by  a  symmetric  operator  and  hence  is  closable. 
Let  6  denote  the  closure.  6  is  referred  to  as  the  Dirichlet  form  of  y  and 
D  the  associated  Dirichlet  operator. 

We  can  prove  that 

Proposition  2.3.  (Carmona)  The  form  domain  Q(S)  Is  H^(y). 


We  show  the  relationship  between  Dirichlet  forms  and  Quadratic  forms 
associated  with  Schrodringer  Operators.  Let  the  potential  V  satisfy 
hypothesis  (Hi).  We  assume  that  inf  spec  (H)  =  E  is  an  eigenvalue  and 
let  41  be  the  corresponding  ground  state  eigenfunction. 

Let  h  =  -  Log  ip  and  define  the  Borel  probability  measure  p  on  E  by 


dp(x) 


-2h(x) 

e 


&x 


Define  the  operator  D  by 
(2.1U)  D  =  C(H-E)C_1 

2  2 

where  C  is  the  unitary  operator  from  L  ClR,dx)  -*•  L  (p)  defined  by 
(2.15)  C<|>  =  ,  <J>  e  L2(]R,dx). 


_  2 

D  is  a  positive  self-adjoint  operator  in  L  (p),  0  is  a  simple  eigen¬ 
value  and  the  constant  function  1  is  the  corresponding  eigenfunction, 
fact 

In  D  is  the  unique  positive  self-adjoint  operator  associated  to  the 
/  \ 

closed  positive  bilinear  form  6  corresponding  to  e-E  (in  the  unitary 

OO  .  — 

equi valance  C)  and  Cc0R)  is  a  core  for  6. 

Since  41  is  bounded  and  locally  bounded  away  from  0,  h  is  bounded  and 

locally  bounded  above.  Since  ipeQ( )  and  the  first  order  partial  derivatives 
2 

of  h  are  in  L.  ,  we  can  associate  with  p  a  Dirichlet  form  6  and  a  Dirichlet 
loc . 

since  a  00 

Operator  D.  Now^.6  is  defined  as  the  closure  of^form  whose  domain  is  Cc(]R)  and 

C  (E)  is  a  form  core  for  6,  one  can  show  that  6=6  and  D  =  D  and  hence 
c 

the  Dirichlet  form  and  Dirchlet  operator  are  unitarily  equivalent  to  the 


Schrodinger  form  and  Schrodinger  operator. 
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2.L.  Stochastic  Processes  As sc  .ated  with  Dirichlet  Forms  and.  Dirichlet 
Operators . 

The  unitary  equivalence  between  Schrodinger  and  Dirichlet  operators 
exhibited  in  the  previous  section  has  an  important  role  in  Kelson’s  stochastic 
mechanics  and  also  in  non-linear  filtering  theory. 

We  consider  the  Schrodinger  operator  H  =  -  +  V  defined  as  a  sum  of 

2 

quadratic  forms  on  L  QR,dx).  We  assume  that  inf  spectrum  (H)  =  E  (assumed 

to  he  0)  is  an  eigenvalue  and  the  corresponding  eigenfunction  iji  >_  0  and 

/2 

<Kx)  dx  =  1.  Define  the  Borel  probability  measure  p  by 

2 

dp(x)  =  ( 4*( x ) )  dx.  Let  6  be  the  corresponding  Dirichlet  form  and  D  the 
corresponding  Dirichlet  operator.  Let  h  =  -log  ip. 

We  want  to  construct  a  Markov  diffusion  process  which  corresponds  to  a 
stochastic  differential  equation  with  drift  -Vh.  There  is  an  obvious  difficulty 
in  interpreting  the  stochastic  differential  equation  in  a  strong  sense.  But  we 
can  construct  a  weak  solution  using  the  measure  transformation  technique  of 
Girsanov  and  the  Feynman-Kac  formula. 

To  do  this  we  assume  the  ground  state  ty(x)  >  0,  ¥x  (by  choosing  a  represen¬ 
tative  from  an  equivalence  class)  and  we  also  assume  that  i|>(x)  +  0  as  |x|  ■+•  00 . 

Now  we  use  the  unitary  equivalence  between  the  Dirichlet  operator  D  and  the 

1  d2 

operator  H  =  -  —  — r-  +  V(x)  to  conclude  that 
2  dx2 

(2.16)  (e~tDf)(x)  =  i^(x)_1Ew  [exp(-/g  V(Wg)  )<J;(Wg)f(Wg)ds] 

x 

where  e_tD  is  the  semi-group  generated  by  -D. 

For  each  t  >  0,  consider 


(2.17)  Lt  =  iMx)-1iMWt)exp(-/Q  V(Wg)ds)  ,  which 
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is  a  random  variable,  positive  -a.s.  and  (L^)  =  1.  Hence,  (^.F^L^)  is 
a  martingale,  where  F  denotes  the  smallest  O-algebra  for  which  the  coordinate 

X 


functions  W. — are  measurable.  Hence  for  each  x,  P  =  L  *W 
t  x  t  x 


is  a  probability 


measure  on  (ft,F). 

Now  by  explicit  calculation,  V(x)  =  ^-(-Ah(x)+  |7h(x)  |2)  and  since  h  =  -log  <P, 
we  get  from  ( 2 . IT ) 


(2.18)  =  exp[-h(Wt)  +  h(x)  -  |T/QAh(Ws)ds  -  |  V h(Ws ) |2ds ) ] 


The  idea  now  is  to  apply  the  Ito-differential  to  the  function  h.  There  is  a 
difficulty  here  because  the  function  h  does  not  have  continuous  partial  derivatives 
upto  order  two.  But  h  is  a  convex  function  and  for  these  functions  the  Ito-differ¬ 
ential  rule  can  be  extended  to  continuous  functions  with  first  partial  derivatives 

2  1 

in  L  (]R,  dx)  and  second  partial  derivatives  in  L.  (]R,dx).  Applying  the  gener- 
loc  IOC 

alized  Ito-differential  rule  tc  (2.18),  we  get 

Lt  =  expl-j^  Vh(Wg)dWg  -  2  Iq  I  Vh(Wg)|  2ds] . 

Therefore  by  the  Girsanov  theorem,  the  process  =  W^.  -  WQ  +  Vh(Ws)ds  is  a 
(  F  ,P  )  standard  Brownian  and  calling  W  =  X  ,  the  stochastic  process  (x.|  t^O) 

XX  XX  X 

considered  on  the  probability  space  (Q,  F^ ,  P^ )  is  a  unique  weak  solution  of 

(  dX.  =  -Vh{X  )ds  +  dB. 

)  t  s  t 

(2.19)  ) 

(  XQ  =  x  ,  a.s. 

2 

Furthermore  the  measure  y  defined  by  dy(x)  =  <{i(x)  dx  is  the  unique  finite 
invariant  measure  of  (2.19). 
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From  the  construction  of  the  probability  measure  P  ,  we  see  that  it  has  a 
transition  density 

(2.20)  p(t,y;0,x)  =  ^(y)^”1(x)  [exp-/*  V(Ws )ds | Wt=y]  p(t,y;0,x) 

x 

where  p  is  the  transition  density  of  Brownian  motion. 

We  remark  that  it  is  this  transition  density  we  wish  to  compute  since  this 
corresponds  to  the  fundamental  solution  of  a  parabolic  partial  differential  equa¬ 
tion.  We  shall  see  later  the  importance  of  (2.20)  for  non-linear  filtering  prob¬ 
lems.  For  non-linear  filtering  problems  the  decay  properties  of  this  transition 
density  are  also  of  importance. 

We  investigate  these  matters  now. 

Firstly,  we  can  check  that  for  V  t  >  0,  p(t,y;  0,x)  is  a  continuous  function 
of  the  pair  (x,y).  In  fact  the  following  estimate  holds: 


(2.21) 


!a  constants  c  ,  such  that 
V  t  >  0  ,  V(x,y)  e  1  x  n  , 


p(t,y;  0,x)  <  c1 


c2t 


ip(y)ip  1(x)  p(t,y;  0,x) 


If  the  semigroup 

(T^f)(x)  =  [f(Wg)  exp (-/q  V(w  )ds]  is  compact  (see  Theorem  2.1)  and  its  spec- 
x 

trum  is  strictly  bounded  away  from  zero,  then  the  density  p(t,y;  0,x)  satisfies: 


(2.22) 


3  positive  constants  c^,  c^  such  that 


sup 

yc® 


“C  t 

p(t,y;  0,x)  -  tji(y)2  |<c1^-1(x)e  2  ,  \ft  >  0,  v*  e  ® 


2.5  Hypercontractive  Semigroups 

In  the  previous  sections  we  have  seen  that  for  a  large  class  of  potentials, 

.  V 

the  semigroup  T  has  a  negative  infinitesimal  generator  which  coincides  with  a 

X  2 
1  Q 

self-adjoint  extension  of  -  — — 2  +  V(x).  In  this  section  we  point  out  that  the 
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semigroups  defined  by  the  Feynman-Kac  formula  are  often  Hyper contractive  semigroups. 
These  semigroups  have  played  an  important  role  in  constructive  quantum  field  theory 
and  are  likely  to  play  an  equally  important  role  in  the  theory  of  non-linear  filter¬ 
ing. 

We  follow  the  notation,  hypotheses  (specially  on  the  potential  V)  and  the 
terminology  of  the  previous  sections. 

— tD  2  2 

We  consider  the  Dirichlet  semi-group  (e  ,  t>0)  on  L  (y)  where  dy(x)  =  ip(x )  dx. 
Definition:  The  semigroup  (e  ^,t>0)  is  said  to  be  hypercontractive  if  for  some 
t  >  0  and  some  r  >  2,  e  ^  is  a  bounded  operator  from  L2(y)  into  Lr(y). 

From  our  point  of  view  the  best  approach  towards  the  question  of  hypercon- 
ti activity  is  via  the  approach  of  Gross  using  Logarithmic  Sobolev  inequalities. 
Definition:  The  operator  D  is  called  a  Sobolev  generator  if  for  some  real  constants 
c  >  0  and  Y  we  have 

(2.23)  /  |f|2  Log  |  f  |dy  <  c(Df,f)  +Yl|f||%  +l|f||%  Log  |  |  f  |  | 

B  L  (y)  L  (y)  L2(y )  L2(y) 

for  all  f  e  P(D) .  The  constants  c  and  Y  are  called  the  Sobolev  coefficient  and 
the  local  norm  of  D. 

B 

Logarithmic  Sobolev  Inequality  (Gross) 

If  y  is  a  probability  measure  on  IF  which  satisfies: 

(2.24)  /  |f|2Log|f|dy  <  c/  |Vf|2dy  +  y| |f| |2  ?  +|!f||2p  Log | | f I t  2 

B  B  L  (y)  L^(y )  L  (y) 

for  some  constants  c  >  0,  y  >  0  and  Vf  which  are  bounded  functions  with  distributional 

2 

first  order  derivative  in  L  (y),  then  for  Vr  c  [2,  +  ® [ 

(2.25)  /  |f|2Log|f|dy  <  c(r)  /  Vf*7f  dy  +  y| |f| |r  P  +  |  |  f  l|r  o  l°fiJ  M  I  ? 

B  ®  r  L2(y)  L2(y)  L2(y) 
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.  r-1 


w  f  as  above,  where  f  =  (sgnf)lcl  and  c(r)  =  777 — TT 

r  .  ’  1  2 (r-1)  g 

The  salient  facts  about  hypercontractive  semigroups  and  Sobolev  generators 

are  the  following  (Carmona) 

Proposition  2,b.  (e  ^|t>0)  is  a  hypercontractive  semigroup  if  and  only  if  D  is 
a  Sobolev  generator. ^ 

Proposition  2.3.  D  is  a  Sobolev  generator  with  Sobolev  coefficient  c  if  and  only  if 

2 

-Log  i|J^cD  +  b,  the  above  interpreted  as  quadratic  forms  on  L  (p),  for  some 
constant  b .  _ 


Consider  potentials  satisfying  (Hi)  which  have  the  further  property: 


Vx  e  ]R  ,  a^P(x)  +  b1  £  V^x)  <  a2P(x)  +  b2 

_ 2 

where  an  a„  >  0  ,  b,  ,  b„  c  ]R  and  P  an  even  polynomial,  and  lim  inf  |x|  P(x)  >  0. 

1  d.  1  d  l| 

1X[-KX, 

Proposition  2.6.  Schrodinger  operators  with  potentials  of  the  above  class  generate 
Dirichlet  semigroups  which  are  hypercontractive. 

B 

Example : 

1  d2  d 

(i)  The  operator  D  =  -  —  — 2  +  —  generates  a  hypercontractive  semi-group 

2  dx 

e  tD  on  L2(]R  ,  ■—  e~X  dx). 

A 

(ii)  Consider  a  stochastic  differential  equation: 


dx  =  f(x  )dt  +  dw 

U  L>  L> 


Suppose  that  the  Riccati  equation 


J  i*  p 

—  +  f  =  V(x)  ,  where  V(x)  is  an  even  polynomial  satisfying  the  hypotheses 
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of  this  section  such  that  a  global  solution  exists  for  the  Riccati  equation.  Then 
the  generator  of  the  associated  diffusion  process  is  in  fact  a  Dirichlet  operator 
and  generates  a  hypercontractive  semigroup  on  an  appropriate  L  (y)-space. 


Notes  and  References  for  Section  2 


(i)  For  general  references  for  this  section,  consult 

1.  B.  Simon:  Functional  Integration  and  Quantum  Physics,  Academic  Press, 

New  York,  1979. 

2.  M.  Reed  and  B.  Simon:  Methods  of  Modern  Mathematical  Physics,  Vols.  I 
and  II,  Academic  Press,  New  York,  1972,  1975. 

3.  D.W.  Stroock  and  S.R.S.  Varadhan:  Multi-dimensional  Diffusion  Processes, 
Springer-Verlag,  Berlin,  New  York,  1979. 

(ii)  In  this  section  we  follow  very  closely: 

1.  R,  Carmona:  Regularity  Properties  of  Schrodinger  and  Dirichlet  Semigroups, 
J.  of  Functional  Analysis  33,  259-296  (1979). 

(iii)  The  material  in  Section  2.k  is  apparently  partially  new  and  uses  the  Girsanov 
transformation  to  obtain  a  weak  solution  of  a  stochastic  differential  equation. 
For  a  slightly  different  approach  see: 

1.  H.  Ezawa,  J.R.  Klauder  and  L.  Shepp:  A  Path  Space  Picture  for  Feynman- 
Kac  Averages,  Annals  of  Physics  88_,  588-620  (197*0. 

(iv)  For  the  generalized  Ito  Differential  rule,  see  A.  Bensoussan  and  J.L.  Lions: 

Applications  des  Inequations  Variationelles  en  CSntrole  Stochastique, 

Dunod,  Paris  1978. 

(v)  The  reference  to  stochastic  mechanics  is:  Dynamical  Theories  of  Brownian 

Motion,  Princeton  University  Press,  Princeton,  N.J.  ,  1967. 
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3-  The  Bayes  Formula  and  the  Unnormalized  Conditional  Density  Equation 

Let  (f2,F,P)  be  a  complete  probability  space,  F  =  (F  )  ,  0  <_  t  <_  1  a  non¬ 
decreasing  family  of  sub  o-algebras  of  F  and  (r)  ,F^)  ,  0  <_  t  <_  1  a  Wiener  Process. 
Condider  the  signal  process  z  =  (z  ,F  )  and  the  observation  equation 

(3.1)  y  =/tzds+n. 

1  0  8  1 

Let 

(3.2)  zt  =  h(xt)  ,  and 

x^  be  the  solution  of  the  Ito  stochastic  differential  equation 

(3.3)  dxfc  =  f(xt)dt  +  g(xt)dwt 

We  make  the  following  assumptions: 

Al.  The  functions  f  and  g  are  continuous  and  bounded  and  equation  (3.3)  has  a 

unique  solution  in  the  weak  sense  for  each  initial  condition  xQ  (later  we 

shall  have  to  make  further  smoothness  assumptions). 

A2.  x,  and  n  are  independent  and  E(/dh(x  )2ds)  <  00  . 
z  z  '  u  s 

Under  the  above  hypotheses  we  can  write  down  a  Bayes  formula  for  computing 
E(<J>(x  |Fy)  where  Fy  is  the  o-field  generated  by  (y  |  0<s<t}  and  E(/d<f>(x  )2ds)  <  00  . 
This  relies  on  the  Girsanov  transformation.  It  is  known  that  there  is  a  new  equiv¬ 
alent  measure  P  under  which  y^  and  are  independent,  the  distribution  of  x^  re¬ 
mains  invariant  and  under  this  new  measure  y  is  Brownian  motion.  The  Radon-Nikodym 
Derivative  of  P  with  respect  to  P  is  given  by 


dP 

dP 


R 


t 


exP(/oh(xs)dys  -  2  /oh2(xs)ds)  ~ 
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Hence 

£(<M: 

s(*(x.)|rj)  = 

E(Lt 

Using  the  independence,  it  is  convenient  to  view  this  conditional  expectation  as 

A  A 

expectation  on  a  product  space  (Qx$},AxA,P®P)  and  write  it  explicitly  as 

~  /ftt(x.  (w)  )exp(/^h(x  (w))dy  (co)-i  /%i(x  (co))2ds)  dP(co) 

(3.1,)  <t(x  )  — £_ - i° — i - -J. — 21° _i - 

/()eKp(/0  li(xg{l3))dys£w)-i /^h(xs(w,)2ds)  dP(fil) 


\\  1  Ft> 

IF?) 


This  is  the  Bayes  Formula. 

If  we  replace  y  by  x  for  x  e  C(0,l;y  )  and  define 

S  S  X 

(3.5)  p(t,x,(u)  =  exp(J^  h(x  (w))dx  (w)-^  h(x  (w))2ds)  and 

U  S  S  d  4  u  s 


(3.6)  g(t,x)  =  /sp(t,x,oi)dP(oj)  , 


du 


it 

then  g(t,x)  is  the  Radon-Nikodym  derivative  -r—^-  (t,x)  ,  where  y  is  the  measure 

dUn  y 

induced  on  C(0,l)  by  the  observation  and  y^  is  the  Wiener  measure  on  C(0,l)  in¬ 
duced  by  p. 

Let  °^(<())  denote  the  numerator  of  (3.M.  It  will  be  also  convenient  to  write 
the  numerator  of  (3.L)  as  a  Feynman-Kac  formula 


(3.7)  at(<fr)  =  /x*(Ct)exp(/J  h(Cs)dys-§  h2(5s)dPx(C)  . 
and  X  =  C(O.l) . 

Let  be  the  semigroup  of  the  diffusion  process  corresponding  to  (3.3)  and 
1  2  Q2  3 

Lq  =  g  g  (*)  — 2~  +  f  g—  be  its  infinitesimal  generator.  Then  a  satisfies  the 
stochastic  partial  differential  equation 
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(3.8)  dat ( 4> )  =  CTt(LQ4> )dt  +  ot(L1<!>)dyt  ,  v  <f>  e  P(Lq)  O  0(1^),  where 

is  the  unbounded  multiplication  operator  h. 

This  is  one  of  the  fundamental  results  of  non-linear  filtering.  Now  using 
the  fact  that  h(xg)  is  a  semi-martingale  and  using  the  relationship  between  Ito 
and  Fisk-Stratanovich  integrals  we  get 

0  (h$)dy  =  a  (h<{>)*dy  -  ^  <a(h<j)),y>  ,  where  <  ,  >  denotes  the  quad- 

us  s  us  s  d  t  t 

ratio  variation  and  •  denotes  the  Fisk-Stratonivich  integral  and  hence 

we  obtain 

(3.9)  dat(<(>)  =  at ( Lq4> ) dt  -  |at(h2<f>)  +  Ot(h<(>)*dyt  ,  which 
we  write  symbolically  as 

do  (<|>)  .  2 

(3.10)  — — —  =  at[(LQ  +  yth  -  -  h  )($)]  ,  which 

we  can  integrate  by  the  Feynman-Kac  formula 

(3.11)  ot((f)  =  Eo  [ exp(/^  y^hds  -  ~  f*  h2ds)4>(xt)] 

where  a  is  a  measure-valued  random  variable  independent  of  y  such  that  E[0q(<|>)] 
U (4> )  and  p  is  the  initial  distribution. 

Define 

z(t>  v  yo}  =  i[Lt|a{xt}xFt] 

and  define 

q(t,  z,  y£)  =  E(t,  z,  yg)p(t,z) 


where  p(t,  z)  is  the  density  of  the  x-process.  Then 
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o  (<P)  =  f  4>(z)q(t,  z,  y„)az  . 

Z  R  U 

Now  o^(<f>)  =  /  4x10^,  and  hence  we  see  that  O’  has  a  density  q(t,  z,  y^)  a:-  . 

hence  from  (3.7) 

t  t  _ 

(3.12)  q(t ,  z,  yt  )  =  /  exp  (/  h(x  )dy  f  h2(x  )ds)dUz(x) 

0  X  0  3  3  2  0  3 

where  the  integration  is  over  the  path  space  of  x  with  x_  =  0  and  x =  z.  It 

U  X 

can  be  shown  that  q^  =  q(t,  z,  y^)  satisfies  the  stochastic  partial  differential 
equation 

<3.13)  ■Ht  -  aj  -MV* +  • 

where  L*  is  the  formal  adjoint  of  LQ. 

Equation  (3.13)  is  the  Mortensen-Zakai  equation  for  the  unnormalized 
conditional  density. 


3.1  Mortensen-Zakai  Equations  and  Gauge  Transformations 

The  study  of  non-linear  filtering  is  the  study  of  the  Mortensen-Zakai  equa¬ 
tion — its  explicit  solution  and  its  group  invariance  properties.  Equivalently  it 

is  the  study  of  the  function  space  representation  of  o  (<}>)  given  in  (3.7)  and  (3.12). 

t  t 

In  (3.7)  we  have  the  stochastic  integral  /  h(£  )dy  .  Under  our  hypotheses  we  can 

0  3  3 


write 


/o  h(5s)dys  =  yth(V  ~  y0(V  '  ^Oysdh(Cs)  • 

Using  the  above  it  is  clear  that  o  can  be  evaluated  for  all  y  e  c(0,l)  (not  just 

r  ot(<t>,y) 

a  subset  of  full  Wiener  measure).  It  has  been  shown  by  Clark  -  Jj~y)  iS  a  ver“ 

A  b 

sion  of  the  conditional  expectation  0. 


An  equivalent  way  of  considering  this  problem  is  to  eliminate  the  stochastic 


integral  from  the  Mortensen-Zakai  equation  (3.13). 

This  can  be  done  by  making  the  observation  that  the  operator  =  multiplica¬ 
tion  by  h  is  a  diagonal  operator,  and  its  effect  can  be  removed  via  a  time-dependent 
Gauge  transformation.  To  see  this  write  q^  =  exp(h(x)yt)qt 

Then  a  direct  calculation  shows  that  q^  satisfies  an  ordinary  partial  differential 
equation  (parametrized  by  y): 

"dt1  =  exp(-h(x)yt)(L*-|  I^)(exp(h(x)yt*qt) 

It  is  interesting  to  rewrite  equation  (3.1U)  in  a  form  which  brings  out  the 
commutation  properties  of  Lfl  -  -  1^  and  .  If  we  denote  by  =  [Lq--| 
and  =  [L^,Lg]  considered  as  formal  differential  operators  and  computed  on  some 
common  invariant  domain,  (3.13)  can  be  rewritten  as: 


(3.15) 


=  (L0-2  Ll]%  +  ytL2^t  ~  A  L3qt 


Explicitly  =  hx  dx  +  ^2  hxx-fhx^  ’  L3  =  -hx  (assuming  g  =  1  for  simplicity). 

When  we  are  dealing  with  unbounded  observation  operators  h  it  is  this  equation 
which  is  the  easiest  to  deal  with.  The  above  also  shows  that  the  commutators 
and  have  an  important  role  to  play  in  the  understanding  of  equation  (3«lb). 

In  (3.12)  we  see  that  evaluating  q^.  involves  an  integration  over  the  path 
space  of  the  x-process.  The  integration  would  be  simplified  if  this  could  be  done 

with  respect  to  Wiener  measure.  This  corresponds  to  removing  the  drift  term  in 

ft 

the  operator  LQ.  For  simplicity  assume  g  =  1.  Define  the  operator  multiplication 
by  '('(x),  where  <}/  is  invertible  from  the  intertwining  relationship. 
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For  this  purpose,  we  consider  that  the  x^-process  is  governed  by 

dxf  =  f(xfc)dt  +  dwt  ,  and  the  observation  equation  is 

dy  =  x  dt  +  dP 
*t  t  t 

2 

*  1  d  d 

Hence  L  =  —  — ^ - dx  f  ’  and  L1  =  mult‘  by  x* 

dx 

We  do  a  gauge  transformation  and  attempt  to  solve 

dqt  =  LQqtdt  -  V(x)qtdt  +  xqtdt,  where  V(x)=^(fx  +  f2) 

Hence  we  have  to  compute: 

(3.17)  qCt.z.yJ)  =  E  z  [expC/J  Xgdyg  -  ~  /J  Xgds  -  /J  V(xg)ds] 

W 

where  the  integration  over  Wiener-measure  with  x^  *  0  and  xfc  *  z. 

2  2 

We  show  that  if  V(x)  =  ax  +  bx  +  c  ,  and  fx  +  f  =  V(x)  has  a  global  solution 
then  a  sufficient  statistic  for  computing  q  is  the  set  of  equations: 


(3.18) 


-n  ds  ;  =  0  ,  £  =  z 

s  u  t 

-£  ds  -  Vr(£  )ds  +  dy  .  ^ 

s  S  S  s 


We  call  the  above  set  of  equations  the  bi-characteristics  of  the  Mortensen-Zakai 
equation.  To  see  this  we  introduce  the  transformation 


dxg  =  d£g  +  dzg  ,  where  £g  is  given  by  the  above  and  we  do  not  impose 
the  boundary  conditions. 


(l)  This  stochastic  two-point  boundary  value  problem  has  to  be  given  an  approp¬ 
riate  s ample-pat hwise  interpretation. 
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By  essentially  the  Girsanov  Theorem, 

dyz  ,t  t  1  ft  2  > 

—  -  exP(/0nsdZs  -  2/0  Vs)  • 

(1) 

Now  impose  the  boundary  conditions  and  hence  in  terms  of  the  z-variable 
q(t,z,yd)  =Jexp(/Q(zs^s)dys  -  \  -  J*  VU^ds) 

xexP(/o  nsdzs  -  |  /o  nsds)dyz 

where  the  integration  is  over  the  path  space  of  z  with  z  =  0  and  z  =  0. 

U  Xi 

2 

By  applying  Ito's  rule  several  times,  and  using  the  fact  that  V(x)  =  ax  +  2bx  +  c, 
we  obtain 

q(t,z,yj)  =  exp(i  /J  ?gdys  -  /J  b^ds  +  f^)) 

(3.19) 

x  /  exp(-/g(v(zg)  +  |  zg)dy° 

The  terms  involving  the  z-process  can  be  explicitly  evaluated. 

It  is  worth  remarking  that  (3.18)  is  the  solution  of  a  variational  problem. 

To  see  this  let  us  first  interpret  equation  (3.18)  in  the  Stratanovich  sense. 

Then  equation  (3.18)  are  the  necessary  conditions  of  the  following  optimal  control 
problem: 

M*n  2  /o  Us  dS  +  2  C[^S-y3)2  +  2V(5s)ldS 

|  =  u  ;  F  =  0  ,  r  =  z  .1 (2) 

s  s  0  t 

(1)  By  conditioning. 

(2)  This  variational  problem  can  be  given  rigorous  meaning  using  recent 
work  of  BiBmut. 
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If  y  =  0  and  V  =  0,  then  the  equations  (3.18)  correspond  to  the  imaginary 
time  harmonic  oscillator  and  our  methods  show  the  relation  between  a  euclidean 
quantum  harmonic  oscillator  and  an  imaginary  time  classical  harmonic  oscillator. 

These  methods  could  also  be  applied  to  study  the  limiting  behavior  of  the 
Cauchy  problem 

M  =  XV2  *  +  as  A  -*■  0 

dt  x  A 

with  an  initial  condition  ip^(x,0)  =  exp[-SQ(x)/A]  and  show  that  the  limiting  solu¬ 
tion  satisfies  the  corresponding  equations  of  classical  mechanics.  This  would 
provide  an  alternative  derivation  to  the  results  of  Maslov  who  treated  the  Schro- 
dinger  equation  version  of  this  problem. 

Equations  (3.18)  is  related  to  the  smoothing  problem.  It  turns  out,  that 
the  unnormalized  conditional  density  could  also  be  evaluated  by  the  following 
sufficient  statistic. 

(dcxs  =  <f(as)  -  Bs)ds  ;  a0  =  0  «t  -  a 
(3.20)  < 

/  dn  -  _(f  (o  )n  +  a  )ds  +  dy 
\s  ass  s  s 

These  are  the  Euler-Lagrange  equations  for  the  following  optimal  control 
problem: 

Min  I  /o  Usds  +  2  ^0  ((as-ys)2  +  2V(«s)}ds 
u 

a  =  f( a  )  +  u  ;  aA  =  0  a  =  z  . 
s  s  s  0  t 
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For  the  general  Kalman  filtering  problem 

(  =  Fxtdt  +  Gdwt 

(3.21)  < 

(  dyt  =  Hxtdt  +  dnt 

the  equations  corresponding  to  (3.20)  would  be 

!da^  =  Fa^dt  -  GG'$  dt  ;  «  0  =  z 

dBt  =  -H'Ha  dt  -  F'g^t  +  H'dyt 

These  can  he  recognized  to  be  the  smoothing  equations  given  in  Hamiltonian 
form.  Our  methods  show  that  these  equations  are  intrinsically  attached  to  the 
Mortensen-Zakai  equation  and  play  the  role  of  bi-characteristics  corresponding  to 
Hamiltonian-Jacobi  equations. 

Although  we  do  not  do  it  in  this  paper  it  seems  reasonable  to  believe  that 
a  perturbation  theory  analogous  to  Maslov's  work  could  be  carried  out  for  non¬ 
linear  filtering  using  the  framework  used  in  this  section  (see  forthcoming  work 
of  Duncan  and  Mitter ) . 
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in  Communication  Systems  and  Random  Process  Theory:  ed.  J.K.  Skwirzynski,  Sithoff 
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(vi)  The  reference  to  Maslov's  work  is:  V.P.  Maslov:  Theorie  des  Perturbations  et 
Methodes  Asymptotiques ,  Dunod-Gauthier-Villars ,  Paris,  1972. 
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h .  Multiple  Ito  Integrals  and  Fock  Space 

There  is  a  close  relationship  between  the  theory  of  multiple  Ito  integrals 
and  Wick  polynomials.  These  objects  also  have  an  important  role  to  play  in 
Wiener's  theory  of  homogeneous  chaos  and  representations  of  the  Weyl  Commutation 
relations  on  Fock  space.  These  ideas  and  constructions  also  are  of  importance 
in  non-linear  filtering  theory. 

1+.  1  Multiple  Ito  Integrals 

The  multiple  Ito  integral  of  order  K  is  a  map  f  H-  I^.(f):  L2(fi,F,P) 

having  the  following  properties: 

K 

(i)  I,.(h)  =  n  W(  A. )  if  h  =  1  x...xl.  for 

R  i=l  1  A1  \ 


disjoint  rectangles  . 

(ii)  IR(f+g)  =  IR(f)  +  IR(g)  . 

( iii )  If  f,  -*•  f  in  L2(iRK)  then  ijf.)  -  Ijf)  in  L2(fi,F,P)  . 

J  rv  J  IV 

If  k  =  0  and  h  e  ]R  ,  define  I^h)  ~  h* 

Let  f(t1,...,tK)  =  l  f(tTT(i)’*  •  *  ’^(k))  denote  the  symmetrization  of  f. 
The  mapping  f  ^  f  is  the  projection  of  L  (K  )  onto  the  subspace  1^(1!  )  of  L  ( HR  ) 
spanned  by  the  symmetric  functions. 

The  multiple  Ito  integral  has  the  following  properties: 

(a)  For  f  e  L2(IRK)  ,  g  e  L2(]RK')  ,  I  (f)  =  IK(f)  and 

E(lK(f)lK,(g))  =  1(K=K,}  K!  (f,g)  2 

L 


(b)  For  $  £  L  (1R)  and  A  e  C  , 

1  .2 


oo  K 
A 


exp(x/©  Vws  -  2  A  /» ’sds)  =  iff 
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(c)  L2(n,F,p)  =  ©  {iK(r)|f£  i,‘  ( mK ) }  =  ©  l2(irk)  . 

K=0  k=0 


We  now  consider  two  applications  of  multiple  Ito  integrals 
U . 2  A  Representation  Theorem  for  the  Best  Estimate  of  a  Signal 
Consider  the  non-linear  filtering  problem 


and  assume  that  the  hypotheses  of  Section  3  (assumptions  A1  and  A2)  hold.  Suppose 
that  we  want  a  representation  for  z  =  E(z  |F^).  It  is  a  standard  result  that  the 

S  S  w 

innovations  process 


It  is  well  known  that  is  standard  Brownian  motion.  Recently  we  have  shown 
^t  =  ^t  1:10(1  Hence  c  ^  (Q,F^,P)  and  hence  from  property  (c) 


\  =  Z0  +  ft  h(t's)dVs  +  fo  fc 


k  (t,s  ,s  )dv  dv 
—  1  ^  1  ^  P 


and  a  standard  application  of  the  definition  of  conditional  expectation  as  a  pro¬ 
jection  shows 


k1(t. 


s  )  = 


r—  E(x  v  ) 
ds  t  s 


k2(t,Sl,s2)  =  9s  9g 


E(x  v  V  ) 
2  1  S1  S2 


etc . 


This  representation  is  not  too  useful  since  the  computation  of  the  innovations 

A 

requires  computing  z^.  However  for  the  following  problem  it  immediately  leads  to 

a  finite  dimensional  filter.  Suppose  that  z  =  Hx  and  dx  =  Fx  ds  +  Gdw  .  We  are 

s  s  s  s  s 

required  to  estimate  xg  and  =  P(x  ,0<s<t)  where  P  is  a  polynomial  functional  of 

x^  with  separable  kernels.  Then  it  is  not  too  difficult  to  show  that  the  estimate 

is  also  a  polynomial  functional  with  separable  kernels  of  the  innovations  dv  = 

s 

y\ 

dy  -  Hx  ds  where  x  is  the  Kalman  filter  estimate, 
s  s  s 


The  ideas  of  multiple  Ito  integrals  also  have  applications  to  representing 
the  density  q(t,z,yQ)  as  a  multiple  integral  expansion  in  the  semi-martingale  y  . 
^ . 2  Random  Fields 

In  quantum  field  theory  random  fields  (weak  distributions)  and  their  poly¬ 
nomial  functions  (suitably  defined)  have  played  an  important  role.  A  Wiener 
integral  or  more  generally  an  Ito  integral  is  an  example  of  a  random  field. 
Multiple  Ito  integrals  can  be  considered  to  be  polynomial  functions  of  random 
field j,  provided  these  polynomial  functions  are  suitably  renormalized.  These 
are  the  Wick  polynomials.  Orthogonal  polynomials  and  in  particular  the  Hermite 
polynomials  have  a  special  role  to  play  in  this  theory. 
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Ji.P.l  Panic  Pefi  tii  l.imir.  and  l'-.  • i  or. 

Let  H  be  a  real  Hilb":  space,  with  the  scalar  product  <•,*>.  We 
shall  identify  H  with  its  topological  dual.  Let  (fi,A,u)  he  a  probability  space. 
We  denote  by  L°(i2,A,u)  the  space  of  real  ( complex )-valued  random  variables  on 

Definition  b.i.  a  continuous  linear  function 
(k.l)  F  :  H  -*•  L°(fi,A,p) 

is  termed  a  random  field  or  weak  distribution.  Two  random  fields  F^  and  F^  are 

equivalent  if  for  any  {f^,  •  •  •  , fn}  *=  H,  the  joint  distribution  of 

F  (f  ),  .  .  .  ,F  (f  )  and  F0(f  ).  .  .  F  (f  )  are  the  same. 

11  In  21  2n  CKJ 

An  example  of  a  random  field  which  is  of  importance  to  us  is  one  which  is 
"generated"  by  the  Wiener  process.  Let  W(t,w)  denote  the  Wiener  process  for 
t  >  0.  Define  W(-t,u>)  =  W(t,w),  t  0  and  thus  extend  the  definition  to  all  of 
R.  Then  if  we  define 

(b.2)  F  :  L2(R)  -  L°(ft,A,u)  by 

F(f)  =  /fdW(t,u>),  where  the  right  hand  side  is  a  Wiener  integral,  then  F  is  a 

random  field. 

Definition  U .2,  Given  a  random  field  F  :  H  ■*  L°(ft,A,p),  the  mean  functional  is 
the  map. 

(b.3)  Mp(f)  =  E(  F(  f ) )  =  /F(f)dp. 

The  mean  functional  need  not  exist  for  all  f  e  H.  If  it  exists,  then  is  a 
linear  (not  necessarily  continuous)  functional  on  H. 

(l)  Continuous  means  if  f.  -*■  f  in  H  then  F ( f ^ F(f)  in  probability. 
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The  covariance  functional  C_  :  H  x  H  -*■  (C  is  defined  by 

-  j. 

Cp(f,g)  =  E{[F(f)-MF(f)][P(g)-Mp(g)]} 

where  -  denotes  complex  conjugation.  If  Cp  exists  for  all  f,  g  £  H  then  Cp 

is  a  positive  Hermitian  bilinear  form  on  H.  The  variance  functional  Vp  is 

defined  by  V„(f)  =  C„(f,f)  and  the  correlation  functional  by 

r  r  '  *  “ 

Bp ( f , g )  =  E[F(f)FTg)]. 

The  random  field  is  said  to  be  bounded  if  there  exists  a  k  >  0,  such  that 
/  Bp( f ,  f)  <  k  | | f | |  V  f  E  «. 

The  characteristic  functional  <t„:  H  +  I  is  defined  by 

1  r 

(**•5)  =  E[elF(f)]  =  /elF(f)du. 

It  is  a  continuous. positive  definite  functional  and  (0)  =  1. _ 

Remark:  Bochner's  theorem  extends  to  this  infinite  dimensional  situation.  M 

-  0 

The  following  proposition  is  a  consequence  of  the  Riesz  representation 
theorem. 

Proposition  A ,1.  Suppose  F  is  a  bounded  random  field.  Then  there  exists  a 
unique  vector  f  e  H  (the  mean)  and  unique  bounded  self-adjoint  non-negative 

r 

operators  Rp  (covariance)  and  Sp  (correlation)  such  that  Mp(f)  *  <f,fp>  , 
Cp(f,g)  =  <Rpf,g>  and  Bp(f,g)  =  <SFf’R>‘g 

Definition  .  3.  A  random  field  N  such  that  B^(f,f)  < 00  with  characteristic 
function 

(!*.6)  *F(f)  »  expfiMjjtf)  -  |  VH(f)] 

is  termed  a  Gaussian  random  field. 
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We  shall  be  interested  in  i.'  normal  random  fields  N  with  mean  zero 

c 

variance  parameter  e  >  0.  Its  covariance  function  is  C^(f,g)  =  c<f,g>.  If 
c  =  1  then  we  get  the  unit  isenorm'tl  random  field.  Its  characteristic  function 

I  Ml2 

=  e  2 

Now  let  us  consider  a  bounded  Gaussian  random  field  N.  Its  characteristic 
function  can  be  written  as: 

$N(f)  =  exp[i<f,fK>-|<RNf,f>.]. 


In  particular,  if  the  bounded  Gaussian  random  field  is  the  isonormal  field  with 
parameter  a  >  0  then 

rn  "  0I- 

Let  8(H)  denote  the  Borel  a-a.lgebra  of  H  and  let  p  be  a  probability  measure 
on  8(H).  Define  the  random  field  F  :  H  ->■  L°(H,B(H),p)  by  [F(f)](g)  =  <f,g>. 

In  this  way  every  Borel  Probability  measure  p  on  8(H)  produces  a  random  field. 
Suppose  F  :  H  -*  L°(fi,A,v)  is  a  random  field.  Then  we  say  F  is  generated  by  p  on 
8(H)  if  F  is  equivalent  to  F.  We  then  have 

Proposition  U.2.  A  Gaussian  random  field  N  is  generated  by  a  probability  measure 
on  (H,8(H))  if  and  only  if  the  covariance  operator  R  is  trace  class.  __ 

Thus  if  dim  H  =  °°,  the  isonormal  random  field  with  covariance  operator  ol,  o  >  0 
could  never  be  produced  by  a  countably-additive  probability  measure  on  (H,8(H)). 

By  the  Gross  construction  of  "lifting",  the  finitely  additive  measure  corresponding 
to  the  isonormal  field  can  he  "extended"  to  a  countably  additive  measure 


on  a  separable  Banach  spae*-.  ■ 

Y/t 
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;:(RU)  will  denote  the  -hwart:.  suae--  ■.  1  ■•■[•easing  functions,  and 

S'(Rn)  its  dual,  the  space  of  tempered  distributions.  In  the  sequel  it  will  be 

often  convenient  to  define  a  random  field  as  a  mapping  F  :  V  -*•  L°(fi,A,p)  where 

V  is  a  topological  space.  In  particular  we  shall  hove  occasion  to  take  V  =  S(R). 

We  shall  often  make  the  assumption  that  the  random  field  is  determined  by  f  £  V 

in  the  sense  that  the  smallest  o-algebra  w.a.  to  which  the  random  variables 

{F(f)|feV}  is  measurable  is  A.  We  call  such  a  random  field  full . 

Construction  of  the  Unit  Gaussian  Random  Field. 

Let  H  be  a  separable  Hilbert  Space  and  let  (e  )  be  an  orthonormal  basis 

for  H.  Let  Rbethe  one  point  compactification  of  R  and  let  fi  =  IT  R  be  the 

Cartesian  countable  product  of  copies  of  R.  SI  is  a  Compact  Hausdorff  space  in 

the  Tychonov  topology.  Let  C(SI)  be  the  set  of  continuous,  functions  on  SI  and 

let  P(fi)  be  the  set  of  functions  <b(x,  ,...,x  )  in  C (ft)  which  depend  only  on  a 

1  n 

finite  number  of  copies.  With  the  supremum  norm  topology  C(JI)  is  a  Banach  space 
and  (by  the  Stone-Weierstrass  theorem  F(SI)  is  dense  in  C(ft).  For  4>  e  P(SI)  define 

“  f  .  liiLii-  Rn 

k(<f>)  =  (2tt)  y<J)  (x,  , . . .  ,x  )e  c  dx  ...dx 

In  in 

Then  k  :  P(ft)  -*•  R  is  linear  and  |k(c{>)|<_|  |<t*|  )  and  hence  k  can  be  entended  to  a 
continuous  linear  functional  on  C(S2).  By  the  Piesz  Representation  theorem  there 
exists  a  Borel  probability  measure  |J  on  8(C)  such  that  k ( cf> )  =  y<|>du  ,  V  <p  e  C(fi) 


Let  F(  e  )  :  £2  -*•  R  be  mu !  *.  i  t  Li  cation  by  x  ,  n  =  1,2,...  Then  F(e  )  is  a.  e. 
n  n  n 

2 

in  R  and  hence  F(e  )  :U  ■*  R,  measurable.  It  can  be  shown  that  F(e  )  e  L  (f2,8(C),p) 
n  n 

and  {F(e  |n=l,2,...}  generate  B(d).  If  f  e  H  and  f  =  N  a  e  then  Sa  F(e  )  con- 
tr  ^nn^-nn 

"S 

verges  in  L^(£i,8(C)  ,vi )  to  an  element  F(f).  This  is  the  unit  full  Gaussian  Random 
field  on  H. 

4.2.2  Random  Fields  on  .^(R'1)  and  White  Noise 

Let  F  :  S(Rn)  ■+  L°(f2,A,P)  be  a  random  field.  The  derivative  F'  of  the  random 
field  is  defined  by 

F' (f)  =  -  F(f' ) 

F'  is  a  random  field  and  if  F  is  Gaussian,  F'  is  also  Gaussian.  How  a  random 
field  F  :  S(Rn)  -*■  L°(Q,A,u)  .is  generated  by  a  stochastic  process  (assumed  to  have 
square  integrable  sample  paths)  X  :  Rn  x  £2  ■*  R  if 

[F(f  ](fa>)  =  /f(x)X(x,w)dx  ,  v  f  E  S(Rn). 

Let  W(t,w)  ,  t  E  [0,-]  be  the  standard  Wiener  process  and  let  F  be  the 
Gaussian  random  field  generated  by  W(t,to).  The  covariance  functional  of  F  can 
be  computed  as 

cF(f,g)  =  ff  min  ( s ,t )f ( s )g(t )dsdt 

o  o 

/  />  /S  /v 

=  f0  [f(s)-f('«)  ]  [g(t)-g(°°)  ]dsdt 

A  /X 

where  f(t)  =  /tf(  s)ds  and  g(t )  =  ytg(s)ds. 

o  o 

F'  the  derivative  of  F  exists  (in  the  sense  of  distributions)  and  its  covariance 
functional  is  given  by 

C_,(f,g)  =  C  (r',g')  =  /"f(t)g(t)dt  »  <f,g> 

*  h  o 

2 

where  <,>  represents  the  natural  inner  product  on  f>(R)  embedded  in  L  (R). 


F'  is  white  noise. 
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U.  3  Second  Quantization  (After  Sep;al  and  Nelson) 

Let  H  be  a  real  Hilbert  space  and  let  F  :  H  -»•  L°(fi,A,p)  be  the  unit 
Gaussian  determined  random  field.  If  f^,,..,fn  are  orthonormal  in  H  and  <J> 
is  a  Bounded  Baire  function  on  Rn,  then 

-  llxll2 

/  ♦(F(f1) . F(fn))  =  -^-^/n<Mx)e  2  dx 

For  concreteness  (Q,A,y)  may  be  chosen  to  be  countably  infinite  copies  of 
(R,8(R),  (27r)-1/2  e'X^/2  dx). 

If  E  denotes  expectation  on  (&,A,u)  then 

(U.3.1)  E(F(f1)...F(f2n+1))  =  0 

(h.3.2)  E(F(f.  )...F(f_  ))  =  y<f.  ,f.  >  ...  <f,  ,f,  > 

1  2  1  JI  1n  Jn 

where  the  sum  of  over  all  pairings  of  l,...,2n  i.e.  i^<...<in; 

and  (i_,J,,...,i  ,j  )  is  a  permutation  l,...2n. 

11  n  n 

L^(fi,A,p)  is  denoted  by  L^(H)  and  r(H)  denotes  L‘~(H).  F(H)<n  be  the  closed 
linear  span  in  F(H)  of  all  elements  of  the  form  F(f,)...F(f  )  with  m <  n  and  let 
r ( H )  denote  the  orthogonal  complement  of  r(H)<n  in  F ( H )<n -  For  fy,«,.,f  ^ 

we  define  the  Wick  polynomial 


:  F(f1)...F(fn)  : 

to  be  the  orthogonal  projection  of  F(f_)...F(f  )  into  T(h)  .  In  the  special  case, 

i  n  n  ^ 

2  — l/2  —x  / 2 

where  H  is  one  dimensional  and  hence  T(H)  =  L  (R,B(R),  (2m)  e  dx),  r(H)^ 
is  the  one  dimensional  subspace  spanned  by  the  nth  Hermite-  polynomial  and  :xn: 
is  the  nth  Hermite  polynomial  normalized  so  that  the  leading  coefficient  is  1. 

We  have  the  formula 


) . .  .  ) :  >  -  V ,  ,8l>.  .•<%(„, 


I1*.  3.3) 
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where  the  sum  is  over  all  permutations  tt  of  If  all  the  f's  and  g's 

are  equal ,  we  get 


U.3.M  <:F(f)n:  ,  F(f)n> 


1  n  .2  -x  /2 

-fir  /(;x  '■>  e 


dx  =  n! 


Let  be  the  complex! ficat ion  of  H  and  let  denote  the  n-fold  Hilbert  space 
symmetric  tensor  product  of  H^  with  itself.  On  we  define  the  inner  product 
such  that 


(1*.3.S)  <Sym(f10...(x/ifn),  Sym^Qy  . . .  (x)gR)>  =  ■  •<?*(„)  »Sn> 


where 


(!*.3.6)  Sym( f  (x)  . . .  (x ,  f  )  =  — -  N  f  ,  .  (x)  .  . .  ix:  f  ,  , 

^  n  n !  —  tt(1)w  v..-  tt ( n ) 

_  ('!*-3.3)  and  (It.  3. 5),  we  see  , 

rrcm  ^  that  the  mapping  :  F(  ) . .  .F(  f^ ) :  h-  Sym(f1©  . . .  (x)  f  ) 

extends  uniquely  to  a  unitary  operator  from  P(H)  onto  H  .  We  use  this  mapping 

n  n 

to  identify  F(H)  and  H  .  Analogous  to  the  fact  that  the  one-dimensional  Hermite 
n  ti  0 

2  /o  —yf~IO 

polynomials  span  L  (R,8(R),  ( ?u )  e  dx),  Segal  proved 


(•*•3.7)  r ( H )  =  N  H  ,  for  arbitrary  real  Hilbert  space  H. 

n=0  n 

POO  is  Fock  Space. 

If  the  random  field  F(f)  -f fdW,  where  f  £  L^(R)  =  H  and  W  is  the  standard 
Wiener  process,  then  elements  of  r(H)  are  multiple  Wiener  integrals  (in  the 
sense  of  Ito ) . 

The  space  F(H)  is  intrinsically  attached  to  the  structure  of  H  as  a  real 
Hilbert  space.  Thus  if  U  :  H  •*  K  is  an  orthogonal  mapping  of  one  real  Hilbert 
space  into  another,  it  induces  a  unitary  mapping  F(U)  :  T(H)  ■+  T(k),  where  on 
V  r(lT)  =  ..jOtu  •  Similarly  if  I  :  H  -*■  K  is  an  isometric  embedding  then 
n-fold  it  induces  an  isometric  embedding 

F(l)  :  f(H)  -*■  f ( K )  and  similarly  for  an  orthogonal  projection  E  :  H  ■+  K. 
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If  A  :  H  ■*  K  is  a  contraction  then  T(a)  :  F(h)  -*■  T(K)  is  defined  to  he  the  direct 

sum  of  r(A)  :  H  -*■  K  ,  where  T(A)  =  A^xj  . .  .  (x> A.  Now  any  contraction  A  :  H  ■+•  K 
n  n  n  n 

n-f  old 

can  be  decomposed  as 


where  I,  U  and  E  are  as  above. 

Hence  F(A)  =  F(E )F(u)r( I ) .  Now  T(A)  is  doubly  Markovian  in  the  sense  that 

!a  >  0  F(A)ct  _>  o 

t(a)i  =  i 
Er(A)a  =  Ea  , 

Any  doubly  Markovian  operator  is  a  contraction  from  LP  to  LP. 

It  turns  out  that  F ( A )  ;ias  stronger  con!  ra -tiv>  properties  and  the  precise 
statement  of  this  is  an  important  theorem  of  Nelson. 

Theorem  k  .'-i.  1  (Nelson  Hypercontractivity  Theorem). 

Let  A  :  H  -*•  K  be  a  contraction.  Then  F(A)  is  a  contraction  from  L^CH)  -*■  LP(K) 
for  1  <  q  <  p  <  00  provided  that 


If  ( U . 3 . 9 )  does  not  hold  then  F(A)  is  not  a  bounded  operator  from  L^(H)  ■+  l?(K). 


4 . 1*  Absolute  Continuity  of  Gaussian  Random  Fields 

2 

Let  <f>  be  a  Gaussian  mT'  random  field  H  •+  L  (L',A,u)  =  T(H).  We  consider 
the  following  related  un.  r  ■  '  : 

(l)  Given  A  :  H  H  linear,  continuous  when  is  there  a  unitary  map 
U  :  f(H)  -*■  T(H)  so  that 


U  (HfHf1  =  <t>( Af)  V  f  e  H? 

(2)  Given  A  as  in  (l)  when  is  there  an  F  in  F(H)  so  that  each  (J>(f)  is  a  Gaussian 
random  variable  with  respect  to  |  F  j  ^  du  but  with  variance  |-]|Af||2  instead 


(3)  Given  two  Gaussian  random  fields  with  general  covariance  on  the  same  Hilbert 
space  H  when  can  they  be  realised  on  a  single  measure  space  but  with  two  mutually 
absolutely  continuous  measures? 

The  above  problems  are  essentially  equivalent  and  we  present  an  answer  to 

(3)  under  the  assumption  that  A  is  positive  and  has  a  bounded  inverse. 

Theorem  U .  t  .  1  (Feldman,  feral,  Ghale) 

2 

Let  :  H  -*■  L  (fi,A,p)  be  a  Gaussian  unit  random  field  and  let  A  :  H  -*■  H 
be  linear,  bounded,  positive  with  bounded  inverse.  Then  a  necessary  and  suffi- 
cient  condition  for  the  Gaussian  random  field  on  H  with  variance  — |  | Af | |  to 
be  realizable  on  with  measure  v  equivalent  to  y  is  that  A  -  I  be  a  Hilbert 
•hmidt  Operator. 

m 

■■ .  ?  Pi  sous  si  on 


Fock  space  has  an  important  role  to  play  in  the  study  of  the  free  Quantum 
field  which  can  be  considered  to  be  an  infinite  assembly  of  non-interacting  har¬ 


monic  oscillators. 


1  cl  1  2 

Consider  the  harmonic  oscillator  hamiltonian  H  =  -  —  — —  +  ^  x  as  a  self- 
2  2 

adjoint  operator  on  L  (®)  .  If  instead  we  work  with  L  (1R,  g),  where  g  is  Gaussian 

measure  then  the  action  of  H  on  the  Hermite  polynomials  h^  is  given  by  h^  ^  n  h^. 

group  itH 

From  this  it  is  clear  that  the  one-parameter ^  generated  by  H  acts  as 

1  -x2/2  ~1/2 

h  K  eint  h  .  If  wedenoteby  W  the  unitary  map  f(x)  f(x)[ -  e  ]  : 

n  n  /3r 

L^(3R)  ■*  L2(IR)  then  the  operators  p  =  i  and  q  =  x  on  L2(]R)  transform  to 
—  1  —1  1  2  2 

p'  =  WpW  and  q'  =  WqW  and  — (p'  +  q'  )  can  be  identified  as  the  Harmonic  Oscil¬ 

lator  hamiltonian.  By  the  ideas  of  second  quantization  described  in  section  b.3, 
we  can  extend  the  one-dimensional  case  to  an  infinite  dimensional  setting.  When 
this  is  done  this  gives  rise  to  the  particle  representation  of  the  free  field  and 
the  concept  of  a  number  operator. 

Shale's  theorem  on  the  equivalence  of  Gaussian  Random  fields  essentially  shows 


that  the  Kalman  filter  is  analogous  to  the  free  quantum  field.  This  can  be  done 
by  noting  that  the  observations  and  innovations  processes  are  related  by 
v  =  (i-K)y  where  K  is  a  Hilbert-Schmidt  operator. 
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S.K.  Mitter: 


Filtering  Tiirmry  and  Quantum  Fields,  to  appear  in  Asterisque,  1980. 
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5.  Some  Lie  Algebras  of  Interest  in  Filtering  Theory. 

Let  us  recall  that  in  section  3  we  showed  that  the  unnormalized  conditional  density 

of  x  given  the  observations  (y  |o  <  s  <  t}  satisfies  a  stochastic  partial  differ- 

S  " 

ential  equation 

* 

(5.1)  dq(t,x,ui)  =  LQq(t  ,x,to)  +  L1q(t,x,w)dyt 

* 

where  is  the  formal  adjoint  of  the  generator  of  the  diffusion  process  corres¬ 
ponding  to  the  stochastic  differential 


(5.2)  dxt  =  f(xt)dt  +  g(xt)dwt  , 


.ind  hence  LQ  is  given  by 


/c  ,x  T*  -  I  Jl-  2  A.  r 

(5.3)  Lo  ~  2  ,  2  g  "  S  f 

dx 


dX 


and 

(5.10  Li  =  Multiplication  by  h(x). 

.  *  i  2  .  *  1  2 

The  Lie  Algebra  of  oy>erators  LAlLQ  -  —  generated  by  LQ  -  ^  L^  and 

have  an  important  role  to  play  in  filtering  theory.  This  comes  about  by  writing 
equation  (5.2)  in  Stratanovich  form  (a  fact  which  can  be  justified  rigorously  under 
the  hypotheses  we  have  adopted) 

(5.1)'  (t,x)  =  ( Lq  -  ^L^+ytL1)q(t,x) . 


This  suggestive  form  of  writing  is  meant  to  show  the  anology  of  the  filtering 

problem  with  the  interaction  picture  of  luarrtum  physics. 

It  turns  out  that  a  number  of  fill  ■■ring  \  r  d> I . vs  rive  rise  to  Lie  Algebras 
of  operators  which  are  representations  of  known  algebras  which  arise  in  mathematical 
physics.  We  first  suirunar  i  these  known  ("net.:  uhoiii  :  nme  Lie  Algebras  arising  in 


mathematical  physics. 


E .  1 .  The  Heisenberg  Algebra  and  the  Weyl  Algebra. 

Let  n  >  0,  and  let.  (r  .  ...,p  ,n, ,...,q  ,z)  be  a  basis  for  a  real  vector  space 
—  1  n  1  n 

V.  On  this  space  we  can  define  the  structure  of  a  Lie  Algebra  by  defining 
[p^,q.]  =  -[q^p^l  =  z,  the  other  brackets  between  elements  of  the  basis  being 
zero.  Let  us  denote  this  Lie  Algebra  by  W.  The  centre  of W is  IRz  and  we  have 
[N,W]  =  IRz  and  hence  this  Lie  algebra  is  nilpotent. 

If  N  is  a  Lie  algebra  with  centre  c  and  if  c  =  [N,W]  and  dim  c  =  1,  then  M 
is  a  Heisenberg  algebra. 

Proposition  3-1 

Let  N  be  a  nilpotent.  algebra.  Suppose  that  all  characteristic  ideals ^  of  W 
are  of  dimension  <  1.  Then  either  N  is  zero  or  a  Heisenberg  algebra.  ^ 

We  denote  the  algebra  defined  by  2n-generators  p^,q^, . . . >Pn>qn  and  the  bracket 

relations 

[p^qj  =  1. 

Ir.p'lj]  =  [  P±  j )  =  [a.,q^]  =  0  for  i  5*  j 

by  A  ( TFi  )  or  simply  A  .  Those  are  the  so-called  Weyl  algebras, 
n  n  .  .  . 

^  .),  i  j 

Elements  of  the  form  p.  ,q^ 1 , . . .  ,pnn,q^n  (i^  ,  , . . .  .i^,  j^N)  generate  the 

vector  space  A  . 

n 

£ 

In  the  vector  space  !'  =  Ft  f  X  , . . .  ,X  ]  let  P.  be  the  endomorphism  rn—  and 

In  1  dA^ 

Q.  the  endomorphism  of  mult i plication  by  X^ .  We  have 

[r.,Sl  =  i. 


iPj.Q.J  -  [Pj.P.l  ■  |0.,«  1  •  0  for  i  t  .1 


(l)  Let  u  be  a  Lie  algebra.  A  derivation  of  <3  is  a  linear  mapping.  D:  g  -*■  g  such 
that  D([x,y])  =  [Dx,y]  +  (  x , Dy ]  Vx,  y  eg.  A  characteristic  ideal  of  g  is  a 
subspace  which  is  stable  for  all  derivations  of  g. 
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and  hence  there  exists  a  homomorphism  p:  A  ■>  End(E)  such  that  p(p.)  =  P.  and 

n  11 


b  A 


V. 


o ( for  all  i.  The  elements  ,...,Pn  are  linearly  independent 

"l  i  jn 

and  therefore  the  elements  ,...,p^  q^  1  form  a  basis  for  the  vector  space  An 

and  p  is  injective.  It  follows  that  A^  =  A^  (n-copies).  The  repre¬ 

sentation  p  of  A^  in  E  is  termed  the  standard  representation.  Finally  E  is  a 
simple  A^-module  and  the  set  of  A^-endomorphisms  of  E  is  IR  . 

Let  B  be  the  set  of  linear  combinations  of 
m 


X1  J1 


X1  ']n 


Px  Iq  » • • • »Pn  qn  e  An  such  that  i]  +  ^  +...+  i  +  j  <  m 


Then  B  B  C  B 

m  , 

m '  m+m ' 

Consequently  the  graded  algebra  associated  with  A^  equipped  with  the  filtra¬ 
tion  (B  ,B  ,...)  is  the  polynomial  algebra  in  2n-variables . 

Proposition  5-2 


(i)  A^  is  integral  and  noetherian. 

(ii)  The  centre  of  A  is  ]R. 

n 


(iii)  The  algebra  A^  is  simple. 
Proposition  5.3 


m 


Let  p  ,q  ,...,p  ,q  be  the  canonical  generators  of  A  . 
11  n  ^n  &  n 

subspaces  S,T  of  A  as  follows: 
n 


Let  us  define  vector 


:n 


S  =  £  {(rp.+rq.  )  |  re®}. 

•  _ 1  ■*  ^ 


i=l 


(ii) 


T  =  l  (  §(p;q  ,P,  )  +  rp  p  +rq  q  )  |  re®} 

l<i  ,j<n  1  -  1  1  1  -1 


Then: 


(i)  ®  ©  S  and  T  are  Lie  sub-algebras  of  A 


(ii)  ®  ©  S  ir  a  Heisenberg  algebra 


m 
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The  relationship  bot.w  "ii  l->  i  st.-nberg  and  Wey]  algebras  is  given  by 

Proposition  5.** 

A  (  IR)  ~  U(N)/J 
n 

where  U(W)  is  the  envelopiru:  algebra  of  the  ( 2n+l )-dimensional  Heisenberg  Algebra 
W  and  J  is  a  two-sided  i  leai  in  U(W). 

5 .  2 .  ?iie  Oscillator  Algebra. 

The  real  Lie  algebra  with  2n  +  2  generators  (h,p  , . . . ,p  ,q  , . . .  ,q,z) 
satisfying  the  bracket  relations 

[h,p.  ]  =  q  j  ,  fii,q_.l  =  =  z  »  and  the  other  brackets  being  zero 


is  defined  to  be  the  oscillator  algebra.  This  is  a  solvable  Lie  algebra.  This 
algebra  is  the  semi-direct  sum  of  the  one-dimensional  algebra  spanned  by  h  and 
the  Heisenberg  algebra  W. 

5.3.  The  Poisson  Bracket  Algebra. 

Consider  TR2n  =  {(q,p)  |  q  =  (q  ,...,qn)  ,  p  =  (p^,...,pn)}  with  its  stan¬ 

dard  symplectic  structure  (see  later  section).  For  f,g  E  C  (lR^n)  ,  the  Poisson 
bracket  is  given  by 


{f,g} 


3iL_  _  AL_  .  9fi_  'j 
3Pj  8pi  9qi  / 


The  real  vector  space  of  polynomials  in  (q,p)  denoted  by  P^  is  a  Lie  algebra  under 
the  Poisson  bracket  operation. 

Proposition  5.5 

2  U 

is  generated  by  the  two  polynomials  q^  and  h  =  p  +  q^  if  n  =  1  and  and 

n  2  U  n_1 

h  =  J  { p .  +  q.)  +  y  q.  q.  ,  if  n  >  1.  That  is  P  coincides  with  the  smallest 
.**,  i  i  ,  ,  i  i+l  n 

i=l  i=l 

Lie  subalgebra  containing,  q^  and  h. 
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Proposition  5-6 

P  /3R  is  simple. 

n  g 

S. 1  Examples  of  Non-linear  Filtering  Problems  and  Their  Lie  Algebras 
Example  1 ■ 

Consider  the  non-linear  filtering  problem 


(5.5) 


dx  =  f(x  )dt  +  dw 

T*  T>  T. 

dyt  =  h(x  )dt  +  dn^ 


oo 

and  let  us  assume  that  f,  h  C  C  (E)  .  In  the  light  of  section  3,  the  Mortensen- 
Zakai  equation  is 


dqt  =  (L*-|  L^)qtdt  +  L^-dy,. 


where  I,  =  —  -  —  f 

0  2,2  dx 

dx 


=  multiplication  by  h(x). 

We  then  have 
Proposition  5-7 

.  *  l  2  i 

LAILq-—  L  ,  L^i  is  finite  dimensional  only  in  the  case 

(i)  h  =  ax  +  B. 

2  2 

(ii)  f  +  f  =  ax  +bx+c  , 

OO 

where  the  Lie  algebra  of  operators  is  computed  on  the  common  domain  C  (1R)  or  S(l0  . 

In  the  above  if  we  want  the  diffusion  process  to  be  defined  globally  on  ]R  , 

2  P 

so  we  assume  that  the  Riceati  equation  f  +  f  =ax  +bx+c  has  a  global  solu¬ 
tion. 

The  example  covered  by  Proposition  5.7  has  recently  been  considered  by  Benes. 
From  Section  3.1,  we  can  see  that  we  can  remove  the  effect  of  the  drift  by  a  Gauge 


transformation.  From  (3.15)  we  see  that  it  is  enough  to  solve 


d^t  =  V-tdt  -  V(x)qtdt  +  hqt  •  dyt 

Jl  2  * 

But  LA{L*  -  :^L^,  L^}  -  LA(I.q  -  V(x,i,L^},  and  under  the  hypotheses  of  Proposition 
5.7  the  second  Lie  algebra  has  a  basis  consisting  of  the  elements 


S  "  2L1’  Lr  ^0  “  2L1’  Ll^  L2  01  dx  ’  ^L2’  L2^  aI' 


This  however  is  essentially  the  Harmonic  Oscillator  algebra  and  corresponds 
(essentially)  to  the  filtering  problem 


I  dyt  =  xtdt  +  dnt 

which  is  a  Kalman  filtering  problem. 

It  should  also  be  remarked  that  there  is  no  difficulty  in  extending  this 
example  to  the  multi-dimensional  situation 

dx  =  f(x  )dt  +  dw 

w  X>  C 

dy^  =  Hx^dt  +  dnt  , 

provided  f:  Rn  -*■  Rn  is  VF  =  (|~  »  •••  >  |~~) '  and  H:Rn  .  This  fact  is  also 

X1  n 

clear  from  our  considerations  relating  Schrodinger  and  Dirichlet  operators. 
Finally,  this  problem  is  the  analogue  of  the  imaginary  time  Harmonic  0sci3intor 


problem  with  an  external  force. 
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Example  2. 

Consider  the  non-linear  filtering  problem 

Xt  =  Wt 

(5.6) 

dyt  =  xtdt  +  dnt  . 

This  is  the  so-called  cubic  sensor  problem.  From  Proposition  5-7, 

2 

LAfi-  — —  -  ,  x3}  is  infinite  dimensional.  From  the  calculations  required  to 

dx^ 

establish  Proposition  5.7  it  can  be  shown  that  the  Lie  algebra  is  isomorphic  to 
the  Weyl  algebra  which  is  simple. 


Example  3 . 

2 

Consider  the  model  given  by  (5.5)  and  suppose  that  f^  +  f  =  V(x)  has  a 

global  solution  where  V(x)  say  is  an  even  positive  polynomial  (other  than  the 

1  2 

quadratic).  Then  from  Proposition  5.7,  LA{L*  -  gL^,  L^}  is  infinite  dimensional. 
Even  more,  these  Lie  algebras  (modulo  their  centre  which  is  R)  are  simple. 

This  fact  follows  by  doing  calculations  similar  to  those  involved  in  establishing 
that  the  Poisson  bracket  algebra  is  simple.' 


Example  L. 

rt> 

Let  us  consider  the  model  of  (5.5)  and  let  f,  h  6  C  (U)  where  U  is  some 

1  2  00 

open  set  in  R.  Then  by  restricting  L^  -  j L  and  L^  to  C  (U)  the  Lie  algebra 


1  2 

LA{L*  -  — L^,  L^}  is  finite  dimensional  only  in  the  (prototype) 


case 


(i )  h  -  X 

(ii)  fx  +  f2  =  - 
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Note  that  the  last  equation  (Riccati)  has  finite  escape  time. 

Suppose  we  consider  the  slightly  more  general  model 

dxt  =  f(xt)dt  +  g(xt)dw  ,  where  f,  g  6  C°°(U) 

U  open  in  R  and  g(x)  >  0,  x  6  U.  In  this  case  we  can  remove  the  diffusion 
term  g  by  a  non-linear  smooth  change  of  coordinates. 

Let  x^.  =  oi(z^)  where  a  is  smooth  and  invertible.  Then  by  the  Ito  differen¬ 
tial  rule 

dxt  =  azdzt  +  ^azzdt  =  f(a(zt))dt  +  g(a(zt))dwt  . 

Hence 

dzt  =  (az )-1( f (a( zfc ) )dt  -  |azz(zt))dt  +  (az)_1g(ot(zt)  )dvt 
Let  a  satisfy  the  differential  equation 


S=g(a(z)), 


and  let  I  be  the  maximal  interval  in  R  on  which  the  solution  is  defined.  Then 
dzt  =  (g(a(zt))_1f(a(zt) )  -  |ga(zt))dt  +  dwt  =  ?(zt)dt  +  dwt  . 


The  observation  equation  is 


dy^  =  K(zt)dt  +  dri^  where  H  =  hCaCz^)). 


How  it  is  easy  to  see  that 


rA/l  d2  2  Id  ,  1.2  .  i  T.rl  d2  d  ,  lc-2 

LA  2  2  g  -2dxf_2h,h  LA^2  ^“dx^"^’*^ 

dx  dx 


where  the  first  Lie  algebra  is  computed  on  the  common  invariant  domain  C  (U) 

00 

and  the  second  on  the  common  invariant  domain  C  (i). 

5.5  Significance  of  the  Results 

r  *  12  i 

If  the  Lie  Algebra  LAiLq  -  —  L^,L^/  is  finite  dimensional,  then  it  should  be 
possible  to  construct  the  filter  by  integrating  the  Lie  algebra.  We  consider  this 
aspect  of  the  problem  in  the  next  section.  If  the  Lie  algebra  is  infinite  dimen¬ 
sional  and  simple  then  we  conjecture  that  it  cannot  be  represented  by  a  Lie  alge¬ 
bra  cf  vector  fields  with  analytic  coefficients.  That  is,  we  cannot  represent 
the  solution  q(t,z,yg)  of  the  Mortensen-Zakai  equation  by  means  of  a  finite  dimen¬ 
sional  sufficient  statistic  described  by  a  vector  differential  equation 


(5.7)  dat  =  a(a  )dt  +  b(at)dyt 


Suppose  we  are  interested  only  in  computing  unnormalized  conditional  statistics 
'Mx  ).  If  we  denote  this  by  then 

*  f  "t 

<+>t  =  J] p  <f>(z)q(t,z,y0)dz  . 


It  may  be  possible  to  represent  ^  as  the  output  of  a  vector  differential 

system : 


(5.8) 


da.  =  a(a  )dt  +  b(a  )dy 
t  t  t  t 

♦t  = 


A  theorem  of  Brockett  says  that  for  this  to  be  possible  there  must  exist  a 

.  *  i  2  i 

homomorphism  between  the  Lie  algebra  of  operators  I.AiLq  -  ^  L^,L^J  the 
algebra  of  vector  fields  a  -  —  b^b  and  b  such  that  under  the  homomorphism 
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Lq  -  —  L  H-  a  -  —  b^b  and  I , ^  1*  b.  This  would  suggest  that  the  ideal  structure 

.  *  12  , 

of  the  Lie  Algebra  L  =  LAI ( L^  -  —  L^,L^1  should  be  important.  If  there  exist  non¬ 
trivial  ideals  1  then  ;i  rami  idnt.r  for  such  a  homomorphism  would  be  ^  :  L  -*  L/I  . 
The  results  of  this  section  show  that  for  a  large  class  of  problems  this  line  of 
attack  will  not  be  possible,  since  the  Lie  algebra  L  is  infinite  dimensional  and 
simple . 

A 

A  possible  approach  might  be  to  represent  $  as  the  output  of  a  delay  system, 
that  is  look  for  a  representation  of  the  form 

dat  =  a(at,at_0)dt  +  b(<We)dyt 

K  =  J-ec(at,at-o)d0  • 

There  is  some  reason  to  believe  that  it  might  be  possible  to  do  this  for  the 
cubic  sensor  problem  considered  in  Example  2. 
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Notes  and  References  for  Section  5 

(i)  The  idea  of  studying  the  Lie  Algebra  of  operators  involved  in  the  Mortensen- 

Zakai  equation  is  independently  due  to  R.W.  Broekett  and  myself.  I  was  motivated 

by  the  analogy  between  the  Kalman  filter  and  the  Free  Quantum  field  and  I  discussed 

these  'ideas  in  a  series  of  lectures  in  the  University  of  Maryland  in  December  1977- 
this 

In  connection  see:  R.W.  Broekett:  Remarks  on  Finite  Dimensional  Non-linear  Esti- 
/\ 

mat ion,  presented  in  the  Conference  on  Algebraic  and  Geometric  Methods  in  System 
Theory,  Bordeaux,  France,  September  1978,  to  appear  in  Asterisque,  1980. 

S.K.  Mitter:  Filtering  Theory  and  Quantum  Fields,  presented  in  the  Conference  on 
Algebraic  and  Geometric  Methods  in  System  Theory,  Bordeaux,  France,  September  1978, 
to  appear  in  Asterisque,  1980  and  the  references  cited  in  the  above  two  papers. 

(ii)  For  the  material  on  Heisenberg  and  Weyl  Algebras  see:  J.  Dixmier:  Algebres 
Enveloppantes ,  Gauthier-Villars ,  Paris,  197^. 

Propositions  5.5  and  5.6  on  the  Poisson  Brackett  Algebra  are  due  to  A.  Avez  and 
A.  Heslot:  L'algebre  de  Lie  des  Polynomes  en  les  Coordonnees  Canoniques  munie 
de  Crochet  de  Poisson,  C.R.  Acad.  Sc.  Paris  t.  288  Serie  A,  Mai  1979,  pp.  831-833. 

(iii)  Proposition  5-7  is  due  to  Daniel  Ocone  and  is  related  to  non-linear  filtering 
problems  first  introduced  by  V.  Benes.  See  D.  Ocone:  Doctoral  Dissertation, 
Mathematics  Department,  M.I.T.,  June  1980  and  V.  Benes:  paper  to  appear  in 
Stochastics,  1980. 

(iv)  The  content  of  Examples  2  and  Examples  3  are  new  and  due  to  Daniel  Ocone  and 

the  author.  The  content  of  Example  U  is  due  to  Daniel  Ocone  (forthcoming  Ph.D. 
dissertation)  and  R.W.  Broekett:  loc.  cit,  paper  in  Decision  and  Control  Confer¬ 
ence  1979-  •  \ 

(v)  The  reference  to  the  theorem  of  Broekett  cited  in  section  5.5  is  contained 
in  R.W.  Broekett.:  loc.  cit. 

(vi)  Recent  work  (unpublished)  by  John  Baras  ns  well  as  M.  Hazewinkel  and  D.  Marcus 
shows  that  the  bilinear  filtering  has  a  nice  ideal  structure  and  many  Alterations . 
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h.  Representation  of  the  Kilter 

The  complete  solution  of  the  non-linear  filtering  problem  requires  computing 
the  propagator  of  the  flortensen-Zakai  equation  or  equivalently  the  unnormalized 
conditional  transition  density  q( t ,z ,y^ | s ,x ) .  We  have  discussed  several  approaches 
to  computing  the  density  q(t,z,yQ).  For  the  class  of  problems  considered  by  Benes 
the  propagator  could  be  computed  by  solving  the  two-point  boundary  value  problem 

d^T  =  dx  ;  fs  =  X  ,  =  2 

dnT  =  -CTdx  -  V^.(CT)dx  +  dyT 

according  to  the  development  of  section  3.2.  This  is  the  approach  of  functional 
integration. 

The  Lie-algebra  viewpoint  to  computing  the  propagator  would  require  that  the 
Lie  algebra  be  integrable.  This  is  a  difficult  question  as  we  can  see  by  discussing 
the  commutation  relationship  of  quantum  mechanics. 

r .  1  Canonical  Commutation  FWations  and  their  Unitary  Representations. 

Consider  a  massive  spinless,  nonrelativistic  particle.  Its  configuration 

2  2  2 
space  is  typically  L  (R).  The  position  operator  q  :  L  (R)  ■*  L  (R)  is  defined 
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as  the  operator  with  domain 


(6.1) 


(P(q)  =  (f  £  L‘(R)  |  xf(x)  £  L  (R)l  and 

(qf)(x)  =  x  f(x). 

2  2 

The  momentum  operator  p  :  L  (R)  -*■  L  (R)  has  domain 


(6.2) 


P(p)  =  (f  £  L2(R)  |  f'  e  L2(R)}  and 

,  »  .  df 

pf(x)  =  i  ^  • 


The  Schwartz  space  S(R )  c  P(q)  nP(p),  S(R)  is  dense  in  L  (R)  and  is  left 
invariant  under  q  and  p.  Moreover  on  S(R)  they  satisfy  the  Heisenberg  Commuta- 
tion  Relations 


(6.3)  QP-pq  =  i  I 

“■I  ap 

Since  p  is  self-adjoint,  the  operator  U(a)  =  e  ,  a  £  R  is  unitary 
and  the  operators 

(u(a)  |  a  E  R)  forms  a  one-parameter  strongly  continuous  unitary  group. 
Now  P(q)  is  invariant  under  U(a),  a  e  R  and  it  can  be  shown  that 


(6.U)  U(a)qll(-a)  =  q  -  a  I  on  P(q). 


This  is  the  Schrodinger  form  of  the  Canonical  Commutation  Relations. 

Since  q  is  self-adjoint  it  generates  a  strongly  continuous  one-parameter 
unitary  group  V(b)  given  by 

(6.5)  V(b)f(x)  =  e"1UXf(x). 

It  can  be  shown  that 

(6.6)  U(  a)V(b)  =  e1,lbV(b)U(a) 

This  is  done  by  first,  chocking  it  on  S(R)  and  then  extending  by  continuity 
to  all  of  L?(R) , 

(6.6)  represents  the  Woyj  Form  of  the  Commutation  Relatione. 


» 
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It  is  a  well  known  fact  thrd  the  Heisenberg  form  of  the  canonical  commutation 
relations  do  not  give  rise  to  a  unitary  group  representation,  while  the  Schrodinger 
and  Weyl  forms  do.  This  gives  rise  to  the  following  general  question:  Suppose  we 
have  a  representation  of  a  Lie  algebra  g  by  skew-symmetric  operators  defined  on  a 
common  invariant  domain  V  in  a  Hilbert  space  H  and  let  6  be  the  simply  connected 
Lie  group  with  Lie  algebra  g.  When  does  the  representation  of  g  come  from  a  uni¬ 
tary  representation  G? 

The  answer  to  this  question  is  connected  with  questions  of  essential  self¬ 
adjointness  and  the  existence  of  a  common  dense  set  of  analytic  vectors.  We  dis¬ 
cuss  some  of  these  questions  now. 

Let  G  be  a  Lie  group  and  X  a  Banach  space.  A  representation  T  of  G  is  a 
mapping  G  **’L(X)  :  a  ^  T(o)  ,  where  L ( X )  is  the  set  of  bounded  operators  such 
that  T( e )  =  I,  e  being  the  identity  element  and  TCa^a^)  =  T(a1)T(a2),  for  all 
op,  e  G  and  y  x  e  X,  an-  T(a)x  is  continuous  (with  X  the  norm  topology). 

The  representation  is  called  unitary  if  X  is  a  Hilbert  space  and  each  T(a)  is  a 
unitary  operator. 

A  vector  x  e  X  is  an  analytic  vector  for  T  in  case  the  mapping  a  tt-  T(a)x  : 

G  ■*  X  is  analytic. 

The  salient  facts  connecting  representations  and  analytic  vectors  are: 

Theorem  6.1:  Let  T  be  a  representation  of  a  Lie  group  G  on  a  Banach  space  X. 

Then  T  has  a  dense  set  of  analytic  vectors  in  X.  ^ 

The  answer  to  the  question  raised  earlier  in  the  section  is  contained  in 
the  following  theorem  and  corollary  due  to  Nelson: 

Theorem  6.2:  Let  g  be  a  I/i >-■  algebra  of  skew  symmetric  operators  on  a  Hilbert  space  H 

2  2 

having  a  common  invariant  domain  V.  Let  X  , . . . ,X^  be  a  basis  for  g,  A  =  X^+.-.+X^. 

If  A  is  essentially  self-adjoint,  then  there  is  on  H  a  unique  unitary  represen¬ 
tation  U  of  the  simply  connected  Lie  Group  G  having  g  as  its  Lie  algebra  such 
that  for  all  X  in  g,  U(X)  =  X  (bar  denotes  closure  of  an  operator). 
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Corollary  6.3:  Let  g  be  a  real  Lie  Algebra,  H  a  Hilbert  space.  For  each  X  in  g 
let  p(X)  be  a  skew-symmetric  operator  on  H.  Let  V  be  a  dense  linear  subspace  of 
H  such  that  for  all  X,  Y  in  g,  V  is  contained  in  the  domain  of  p(X)p(Y).  Suppose 
that  for  all  X,  Y  in  g,  x  in  P,  and  real  numbers  a  and  b. 


p(aX  +  bY)x  =  ap(X)x  +  bp(Y)x 

p([X,Y])x  =  (p(X)p(Y)  -  p(Y)p(X))x 

2  2 

Let  X  , ...,X^  be  a  basis  for  g.  If  the  restriction  A  of  p(X^)  +...+p(Xd)  to  V 
is  essentially  self-adjoint,  then  there  is  on  H  a  unique  unitary  representation 
U  of  the  simply  connected  Lie  group  G  having  g  as  its  Lie  algebra  such  for  all  X 
in  g,  U(X)  =  p(X). 

Corollary  6.^:  Let  g  be  a  real  Lie  algebra  with  s  basis  X^,...,X^,  G  the  simply- 
connected  Lie  group  with  Lie  algebra  g,  H  a  Hilbert  space,  C  a  dense  linear  sub¬ 
space  of  H.  Let  p  be  a  representation  of  g  by  skew- symmetric  operators  with  domain 
C.  Then  there  is  a  unitary  representation  U  of  G  such  that  C  is  the  space  of  in¬ 
finitely  differentiable  vectors  for  U  and  U(X)  =  p(X)  for  all  X  in  g  if  and  only  if 


A  =  p(X1)2+...+p(X(J)2 


is  essentially  self-adjoint  and  C  =  H  ,P(An). 

n=l 

How  do  these  ideas  relate  to  the  representation  of  the  filter?  Firstly,  the 
equation  we  are  dealing  with  is  a  stochastic  parabolic  equation  valid  for  t  ^  0. 
Hence  the  operators  L^  and  L^  will  in  general  only  generate  semi-groups.  Consider 
the  Kalman  filtering  problem  (or  any  problem  Gauge  equivalent  to  it).  Then  what 
is  necessary  is  to  give  a  precise  meaning  to  the  time-ordered  operator  product 


Vo  Vi  V?  h 

e  e  e  e 


t.  >  0 
l  — 


as  an  evolution  operator  (in  the  sense  of  Kato). 


For  the  Kalman  filtering  this  can 
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be  done  using  special  methods,  there  are  examples,  like  the  estimation  of  a  Bessel 
process  in  additive  white  noise  where  this  appears  not  possible  to  do.  To  see  the 
connection  to  unitary  representations  of  Lie  groups,  it  might  be  best  to  complexify 
and  try  to  check  the  conditions  of  Nelson's  theorem.  Vie  conjecture  that  if  the  Lie 
algebra  representation  does  not  extend  to  a  unitary  group  representation  then  we 
shall  not  be  able  to  give  meaning  to  the  time-ordered  operator  product  considered 
above . 

Finally,  the  most  direct  way  appears  to  be  to  try  to  integrate  equation  (3.15) 
of  Section  3 


(L0  "  I  LX 


ytVt  “ 


2  ~ 

ytVt 


but  the  question  of  integrating  the  Lie  algebra  also  appears  here. 
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Notes  and  References  for  Section  6 

(1)  For  a  detailed  account  of  group  representation  theory  relevant  to  this  section 
see,  A.  0.  Barut  and  R.  Raczka:  Theory  of  Group  Representations.  PWN-Polish  Scien¬ 
tific  Publishers,  Warsaw  1977-  The  exposition  given  here  closely  follows: 

E.  Nelson,  Analytic  Vectors,  Annals  of  Mathematics,  70,  1959»  pp.  572-615. 

(ii)  The  details  of  treating  the  Benes  and  similar  problems  using  Group  Invariance 
and  Lie  Algebraic  ideas  will  appear  in  a  joint  paper  by  J.  Baras,  S.K.  Mitter,  D. 
Ocone.  See  also  D.  Ocone,  forthcoming  Doctoral  Dissertation,  M.I.T.,  June  1980. 
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